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Description 

[0001] Naturally occurring antibodies (immunoglobulins) comprise two heavy chains linked together by disulfide 
bonds and two light chains, one light chain being linked to each of the heavy chains by disulfide bonds. Each heavy 

5 chain has at one end a variable domain (V H ) followed by a number of constant domains. Each light chain has a variable 
domain (VJ at one end and a constant domain at its other end; the constant domain of the light chain is aligned with 
the first constant domain of the heavy chain, and the light chain variable domain is aligned with the variable domain of 
the heavy chain. Particular amino acid residues are believed to form an interface between the light and heavy chain var- 
iable domains, see e.g. Chothia etal, J. Mol Biol. 186:651-663 (1985); Novotny and Haber, Proc. Natl. Acad Sci USA 

10 82:4592-4596(1985). 

[0002] The constant domains are not involved directly in binding the antibody to an antigen, but are involved in various 
effector functions, such as participation of the antibody in antibody-dependent cellular cytotoxicity. The variable 
domains of each pair of light and heavy chains are involved directly in binding the antibody to the antigen. The domains 
of natural light and heavy chains have the same general structure, and each domain comprises four framework (FR) 

75 regions, whose sequences are somewhat conserved, connected by three hyper-variable or complementarity determin- 
ing regions (CDRs) (see Kabat, E. A. ef al, Sequences of Proteins of Immunological Interest, National institutes of 
Health, Bethesda, MD, (1987)). The four framework regions largely adopt a p-sheet conformation and the CDRs form 
loops connecting, and in some cases forming part of, the p-sheet structure. The CDRs in each chain are held in close 
proximity by the framework regions and, with the CDRs from the other chain, contribute to the formation of the antigen 

20 binding site. 

[0003] Widespread use has been made of monoclonal antibodies, particularly those derived from rodents including 
mice, however they are frequently antigenic in human clinical use. For example, a major limitation in the clinical use of 
rodent monoclonal antibodies is an anti-globulin response during therapy (Miller, R. A. etal, Blood 62:988-995 (1983); 
Schroff, R. W. etal., Cancer Res. 45:879-885 (1985)). 

25 [0004] The art has attempted to overcome this problem by constructing "chimeric" antibodies in which an animal anti- 
gen-binding variable domain is coupled to a human constant domain (Cabilly ef al, U.S. patent No. 4,816,567; Morri- 
son, S. L etal, Proc. Natl. Acad ScL USA 81:6851-6855 (1984); Boulianne, G. L etal., Nature 312:643-646 (1984); 
Neuberger, M. S. ef al., Nature 314:268-270 (1985)). The term "chimeric" antibody is used herein to describe a 
polypeptide comprising at least the antigen binding portion of an antibody molecule linked to at least part of another 

30 protein (typically an immunoglobulin constant domain). 

[0005] The isotype of the human constant domain may be selected to tailor the chimeric antibody for participation in 
antibody-dependent cellular cytotoxicity (ADCC) and complement-dependent cytotoxicity (see e.g. Bruggemann, M. ef 
al., J. Exp. Med. 166:1351-1361 (1987); Riechmann, L ef ai, Nature 332:323-327 (1988); Love ef al., Methods in 
Enzymology 178:515-527 (1989); Bindon etal., J. Exp. Med. 168:127-142 (1988). 

35 [0006] In the typical embodiment, such chimeric antibodies contain about one third rodent (or other non-human spe- 
cies) sequence and thus are capable of eliciting a significant anti-globulin response in humans. For example, in the case 
of the murine anti-CD3 antibody, OKT3, much of the resulting anti-globulin response is directed against the variable 
region rather than the constant region (Jaffers, G. J. ef al., Transplantation 41 :572-578 (1 986)). 
[0007] In a further effort to resolve the antigen binding functions of antibodies and to minimize the use of heterologous 

40 sequences in human antibodies, Winter and colleagues (Jones, P. T. ef al., Nature 321 :522-525 (1 986); Riechmann, L. 
ef al., Nature 332:323-327 (1988); Verhoeyen, M. etal., Science 239:1534-1536 (1988)) have substituted rodent CDRs 
or CDR sequences for the corresponding segments of a human antibody. As used herein, the term "humanized" anti- 
body is an embodiment of chimeric antibodies wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies are typ- 

45 ically human antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

[0008] The therapeutic promise of this approach is supported by the clinical efficacy of a humanized antibody specific 
for the CAM PATH- 1 antigen with two non-Hodgkin lymphoma patients, one of whom had previously developed an anti- 
globulin response to the parental rat antibody (Riechmann, L. ef ai, Nature 332:323-327 (1988); Hale, G. ef ai, Lancet 

so 1:1394-1399 (1988)). A murine antibody to the interleukin 2 receptor has also recently been humanized (Queen, C. ef 
ai, Proc. Natl. Acad. Sci. USA 86:10029-10033 (1989)) as a potential immunosuppressive reagent. Additional refer- 
ences related to humanization of antibodies include Co etal., Proc. Natl. Acad. Sci. USA 88:2869-2873 (1991); Gor- 
man etal., Proc. Natl. Acad. Sci USA 88:4181-4185 (1991); Daugherty etal., Nucleic Acids Research 19(9):2471- 
2476 (1991); Brown ef ai, Proc. Natl Acad. Sci. USA 88:2663-2667 (1991); Junghans ef al, Cancer Research 

55 50:1495-1502 (1990). 

[0009] In some cases, substituting CDRs from rodent antibodies for the human CDRs in human frameworks is suffi- 
cient to transfer high antigen binding affinity (Jones, R T ef ai, Nature 321:522-525 (1986); Verhoeyen, M. ef ai, 
Science 239:1534-1536 (1988)), whereas in other cases it has been necessary to additionally replace one (Riech- 
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mann, L et aL, Nature 332:323-327 (1988)) or several (Queen, C. et aL, Proc. Natl. Acad. Sci. USA 86:10029-10033 

(1989) ) framework region (FR) residues. See also Co et al., supra 

[001 0] For a given antibody a small number of FR residues are anticipated to be important for antigen binding. Firstly 
for example, certain antibodies have been shown to contain a few FR residues which directly contact antigen in crystal 
5 structures of antibody-antigen complexes (e.g., reviewed in Davies, D. R. et al., Ann. Rev. Biochem. 59:439-473 

(1990) ). Secondly, a number of FR residues have been proposed by Chothia, Lesk and colleagues (Chothia, C. & Lesk, 
A. M., J. Mol. Biol. 196:901-917 (1987); Chothia, C. etaL, Nature 342:877-883 (1989); Tramontane A. et al., J. Mol. 
Biol. 215:175-182 (1990)) as critically affecting the conformation of particular CDRs and thus their contribution to anti- 
gen binding. See also Margolies et aL, Proc. Natl. Acad. Sci. USA 72:2180-2184 (1975). 

w [001 1 ] It is also known that, in a few instances, an antibody variable domain (either V H or may contain glycosyla- 
te sites, and that this glycosylation may improve or abolish antigen binding, Pluckthun, Biotechnology 9:545-51 

(1991) ; Spiegelberg et al., Biochemistry 9:4217-4223 (1970); Wallic etaL, J. Exp Med. 168:1099-1109 (1988); Sox et 
aL, Proc NatL Acad. Sci. USA 66:975-982 (1970); Margni et al., Ann. Rev. Immunol. 6:535-554 (1988). Ordinarily, how- 
ever, glycosylation has no influence on the antigen-binding properties of an antibody, Pluckthun, supra, (1991). 

is [0012] The three-dimensional structure of immunoglobulin chains has been studied, and crystal structures for intact 
immunoglobulins, for a variety of immunoglobulin fragments, and for antibody-antigen complexes have been published 
(see e.g., Saul et al., Journal of Biological Chemistry 25:585-97 (1978); Sheriff et. aL, Proc. NatL Acad. Sci. USA 
84:8075-79 (1987); Segal et al., Proc. NatL Acad. Sci. USA 71:4298-4302 (1974); Epp et aL, Biochemistry 
14(22):4943-4952 (1975); Marquart et aL, J. Mol. Biol. 141:369-391 (1980); Furey et aL, J. Mol. Biol. 167:661-692 

20 (1983); Snow and Amzel, Protein: Structure, Function, and Genetics 1:267-279, Alan R. Liss, Inc. pubs. (1986); Cho- 
thia and Lesk, J. Mol. Biol. 196:901-917 (1987); Chothia etaL, Nature 342:877-883 (1989); Chothia etaL, Science 
233:755-58 (1986); Huber etaL, Nature 264:415-420 (1976); Bruccoleri etaL, Nature 335:564-568 (1988) and Nature 
336:266 (1988); Sherman et al., Journal of Biological Chemistry 263:4064-4074 (1988); Amzel and Poljak, Ann. Rev. 
Biochem. 48:961-67 (1979); Silverton etaL, Proc. NatL Acad. Sci. USA 74:5140-5144 (1977); and Gregory etaL, 

25 Molecular Immunology 24:821 -829 (1 987). It is known that the function of an antibody is dependent on its three dimen- 
sional structure, and that amino acid substitutions can change the three-dimensional structure of an antibody, Snow 
and Amzel, supra. It has previously been shown that the antigen binding affinity of a humanized antibody can be 
increased by mutagenesis based upon molecular modelling (Riechmann, L. etaL, Nature 332:323-327 (1988); Queen, 
C. etaL, Proc. Natl. Acad. Sci. USA 86:10029-10033 (1989)). 

30 [0013] Humanizing an antibody with retention of high affinity for antigen and other desired biological activities is at 
present difficult to achieve using currently available procedures. Methods are needed for rationalizing the selection of 
sites for substitution in preparing such antibodies and thereby increasing the efficiency of antibody humanization. 
[0014] The proto-oncogene HER2 (human epidermal growth factor receptor 2) encodes a protein tyrosine kinase 
(p185 HER2 ) that is related to and somewhat homologous to the human epidermal growth factor receptor (see Cous- 

35 sens, L. et aL, Science 230:1132-1139 (1985); Yamamoto, T. et al., Nature 319:230-234 (1986); King, C. R. et aL, 
Science 229:974-976 (1985)). HER2 is also known in the field as c-erbB-2, and sometimes by the name of the rat 
homolog, neu. Amplification and/or overexpression of HER2 is associated with multiple human malignancies and 
appears to be integrally involved in progression of 25-30% of human breast and ovarian cancers (Slamon, D. J. etaL, 
Science 235:177-182 (1987), Slamon, D. J. etaL, Science 244:707-712 (1989)). Furthermore, the extent of amplifica- 

40 tion is inversely correlated with the observed median patient survival time (Slamon, supra, Science 1989). 

[0015] The murine monoclonal antibody known as muMAb4D5 (Fendly, B. M. et aL, Cancer Res. 50:1550-1558 
(1990)), directed against the extracellular domain (ECD) of p185 HER2 ( specifically inhibits the growth of tumor call lines 
overexpressing p185 HER2 in monolayer culture or in soft agar (Hudziak, R. M. et aL, Molec. Cell. Biol. 9:1165-1172 
(1989); Lupu, R. etaL, Science 249:1552-1555 (1990)). MuMAb4D5 also has the potential of enhancing tumor cell sen- 

45 sitivity to tumor necrosis factor, an important effector molecule in macrophage-mediated tumor cell cytotoxicity 
(Hudziak, supra, 1989; Shepard, H. M. and Lewis, G. D. J. Clinical Immunology 8:333-395 (1988)). Thus muMAb4D5 
has potential for clinical intervention in and imaging of carcinomas in which p185 HER2 is overexpressed. The 
muMAb4D5 and its uses are described in PCT- application WO 89/06692 published 27 July 1989. This murine antibody 
was deposited with the ATCC and designated ATCC CRL 10463. However, this antibody may be immunogenic in 

so humans. 

[0016] It is therefore an object of this invention to provide methods for the preparation of antibodies which are less 
antigenic in humans than non-human antibodies but have desired antigen binding and other characteristics and activi- 
ties. 

[001 7] It is a further object of this invention to provide methods for the efficient humanization of antibodies, i.e. select- 
55 ing non-human amino acid residues for importation into a human antibody background sequence in such a fashion as 
to retain or improve the affinity of the non-human donor antibody for a given antigen. 
[0018] It is another object of this invention to provide humanized antibodies capable of binding p185 HER2 . 
[0019] Other objects, features, and characteristics of the present invention will become apparent upon consideration 
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of the following description and the appended claims. 

Summary of the Invention 

s [0020] The objects of this invention are accomplished by a method for making a humanized antibody comprising 
amino acid sequence of an import, non-human antibody and a human antibody, comprising the steps of: 

a. obtaining the amino acid sequences of at least a portion of an import antibody variable domain and of a consen- 
sus variable domain; 

to b. identifying Complementarity Determining Region (CDR) amino acid sequences in the import and the human var- 
iable domain sequences; 

c. substituting an import CDR amino acid sequence for the corresponding human CDR amino acid sequence; 

d. aligning the amino acid sequences of a Framework Region (FR) of the import antibody and the corresponding 
FR of the consensus antibody; 

is e. identifying import antibody FR residues in the aligned FR sequences that are non-homologous to the corre- 
sponding consensus antibody residues; 

f. determining if the non-homologous import amino acid residue is reasonably expected to have at least one of the 
following effects: 

20 1 . non-covalently binds antigen directly, 

2. interacts with a CDR; or 

3. participates in the V L - V H interface; and 

g. for any non-homologous import antibody amino acid residue which is reasonably expected to have at least one 
25 of these effects, substituting that residue for the corresponding amino acid residue in the consensus antibody FR 

sequence. 

[0021] Optionally, the method of this invention comprises the additional steps of determining if any non-homologous 
residues identified in step (e) are exposed on the surface of the domain or buried within it, and if the residue is exposed 

30 but has none of the effects identified in step (f), retaining the consensus residue. 

[0022] Additionally, in certain embodiments the method of this invention comprises the feature wherein the corre- 
sponding consensus antibody residues identified in step (e) above are selected from the group consisting of 4L, 35L, 
36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71 L, 73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 
37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91 H, 92H, 93H, and 103H (uti- 

35 lizing the numbering system set forth in Kabat, E. A. et a/., Sequences of Proteins of Immunological Interest (National 
Institutes of Health, Bethesda, MD, 1987)). 

[0023] In certain embodiments, the method of this invention comprises the additional steps of searching either or both 
of the import, non-human and the consensus variable domain sequences for glycosylation sites, determining if the gly- 
cosylate is reasonably expected to be important for the desired antigen binding and biological activity of the antibody 

40 (i.e., determining if the glycosylation site binds to antigen or changes a side chain of an amino acid residue that binds 
to antigen, or if the glycosylation enhances or weakens antigen binding, or is important for maintaining antibody affinity). 
If the import sequence bears the glycosylation site, it is preferred to substitute that site for the corresponding residues 
in the consensus human rf the glycosylation site is reasonably expected to be important. If only the consensus 
sequence, and not the import, bears the glycosylation site, it is preferred to eliminate that glycosylation site or substitute 

45 therefore the corresponding amino acid residues from the import sequence. 

[0024] Another embodiment of this invention comprises aligning import antibody and the consensus antibody FR 
sequences, identifying import antibody FR residues which are non-homologous with the aligned consensus FR 
sequence, and for each such non-homologous import antibody FR residue, determining if the corresponding consen- 
sus antibody residue represents a residue which is highly conserved across all species at that site, and if it is so con- 

so served, preparing a humanized antibody which comprises the consensus antibody amino acid residue at that site. 
[0025] Certain alternate embodiments of the methods of this invention comprise obtaining the amino acid sequence 
of at least a portion of an import, non-human antibody variable domain having a CDR and a FR, obtaining the amino 
acid sequence of at least a portion of a consensus antibody variable domain having a CDR and a FR, substituting the 
non-human CDR for the human CDR in the consensus antibody variable domain, and then substituting an amino acid 

55 residue for the consensus amino acid residue at at least one of the following sites: 

a. (in the FR of the variable domain of the light chain) 4L, 35L, 36L, 38L, 43L, 44L, 58L, 46L, 62L, 63L, 64L, 85L, 
66L, 67L, 68L, 69L, 70L, 71 L, 73L, 85L, 
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87L, 98L, or b. (in the FR of the variable domain of the heavy chain) 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 
58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91 H, 92H, 93H, and 103H. 

In preferred embodiments, the non-CDR residue substituted at the consensus FR site is the residue found at the corre- 

5 % sponding location of the non-human antibody. 

[0026] Optionally, this just-recited embodiment comprises the additional steps of following the method steps appear- 
ing at the beginning of this summary and determining whether a particular amino acid residue can reasonably be 
expected to have undesirable effects. 
. [0027] This invention also relates to a humanized antibody comprising the CDR sequence of an import, non-human 

w antibody and the FR sequence of a human antibody, wherein an amino acid residue within the human FR sequence 
located at any one of the sites 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71 L, 
73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 
78H, 91 H, 92H, 93H, and 103H has been substituted by another residue. In preferred embodiments, the residue sub- 
stituted at the human FR site is the residue found at the corresponding location of the non-human antibody from which 

is the non-human CDR was obtained. In other embodiments, no human FR residue other than those set forth in this group 
has been substituted. 

[0028] This invention also encompasses specific humanized antibody variable domains, and isolated polypeptides 
having homology with the following sequences. 

20 1 . SEQ. ID NO. 1 , which is the light chain variable domain of a humanized version of 

muMAb4D5: 

DIQMTQSPSSLSASVGDR\rnTCRASQDVNTAVAWYOQKPGKAPKLLIYSASFLESGVP 
SRFSGSRSGTOFTLTISSLQPEDFATYYCQQHYTTPPTFGQGTKVEIKRT 
2. SEQ. ID NO. 2, which is the heavy chain variable domain of a humanized version 
of muMAb4D5): 

EVQLVESGGGLVQPGGSU^LSCAASGFNIKDTYIHVVVRQAPGKGLEWVARIYPTNG^ 
YADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTL 

VTVSS 
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[0029] In another aspect, this invention provides a consensus antibody variable domain amino acid sequence for use 
in the preparation of humanized antibodies, methods for obtaining, using, and storing a computer representation of 
such a consensus sequence, and computers comprising the sequence data of such a sequence. In one embodiment, 
the following consensus antibody variable domain amino acid sequences are provided: 

SEQ. ID NO. 3 (light chain): 

DIQMTQSPSSLSASVGDR\n"ITCRASQDVSSYLAmQQKPGKAPKLLIYAASSLESGVP 
SRFSGSGSGTDFTLTISSLOPEDFATYYCQQYNSLPYTFGQGTKVEIKRT, and 
SEQ. ID NO. 4 (heavy chain): 

EVQLVESGGGLVQPGGSl^lSCAASGFTFSDYAMSWVRQAPGKGLEWVAVISENGGYT 
RYADSVKGRFTISADTSKNTAYLQMNSU^AEDTANTr^CSRWGGDGFYAMDVWGQGTL 

VTVSS 



55 



5 



■l 



EP 0 940 468 A1 

Brief Description of the Drawings 
[0030] 

5 FIGURE 1 A shows the comparison of the V L domain amino acid residues of muMAb4D5, huMAb4D5, and a con- 

sensus sequence (Fig. 1A, SEQ.ID NO. 5, SEQ. ID NO. 1 and SEQ. ID NO. 3, respectively). FIGURE 1B shows 
the comparison between the V H domain amino acid residues of the muMAb4d5, huMAb4D5, and a consensus 
sequence (Fig. 1B, SEQ. ID NO. 6, SEQ. ID NO. 2 and SEQ. ID NO. 4, respectively). Both Figs 1Aand 1B use the 
generally accepted numbering scheme from Kabat, E. A., et al., Sequences of Proteins of Immunologicol Interest 

10 (National Institutes of Health, Bethesda, MD (1987)). In both Fig. 1A and Fig. 1B, the CDR residues determined 
according to a standard sequence definition (as in Kabat, E. A. et al., Sequences of Proteins of Immunologicol 
Interest (National Institutes of Health, Bethesda, MD, 1987)) are indicated by the first underlining beneath me 
sequences, and the CDR residues determined according to a structural definition (as in Chothia, C. & Lesk, A. M., 
J. Mol. Biol. 196:901-917 (1987)) are indicated by the second, lower underlines. The mismatches between genes 

is are shown by the vertical lines. 

FIGURE 2 shows a scheme for humanization of muMAb4D5 V L and V H by gene conversion mutagenesis. 
FIGURE 3 shows the inhibition of SK-BR-3 proliferation by MAb4D5 variants. Relative cell proliferation was deter- 
mined as described (Hudziak, R. M. et al., Molec. Cell. Biol 9:1165-1172 (1989)) and data (average of triplicate 
determinations) are presented as a percentage of results with untreated cultures for muMAbADS (I), huMAb4D5-8 

20 (n) and huMAb4D5-1 (I). 

FIGURE 4 shows a stereo view of a-carbon tracing for a model of huMAb4D5-8 V L and V H . The CDR residues 
(Kabat, E. A. et al., Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 
1987)) are shown in bold and side chains of V H residues A71, T73, A78, S93, Y102 and V L residues Y55 plus R66 
(see Table 3) are shown. 

25 FIGURE 5 shows an amino acid sequence comparison of V L (top panel) and V H (lower panel) domains of the 
murine anti-CD3 monoclonal Ab UCHT1 (muxCD3, Shalaby etai, J. Exp. Med 175, 21 7-225(1992) with a human- 
ized variant of this antibody (huxCD3v9). Also shown are consensus sequences (most commonly occurring residue 
or pair of residues) of the most abundant human subgroups, namely V l k1 and V H III upon which the humanized 
sequences are based (Kabat, E. A. et al., Sequences of Proteins of Immunological Interest, 5 th edition, National 

30 Institutes of Health, Bethesda, MD, USA (1 991)). The light chain sequences-muxCD3, huxCD3v9 and huxl-corre- 
spond to SEQ.ID.NOs 16, 17, and 18, respectively. The heavy chain sequences-muxCD3, huxCD3v9 and huicl-- 
correspond to SEQ.ID.NOs 19, 20, and 21, respectively. Residues which differ between muxCD3 and huxCD3v9 
are identified by an asterisk (*), whereas those which differ between humanized and consensus sequences are 
identified by a sharp sign (#). A bullet (°) denotes that a residue at this position has been found to contact antigen 

35 in one or more crystallographic structures of antibody/antigen complexes (Kabat et al., 1991 ; Mian, I. S. et al., J. 
Mol. Biol. 217, 133-151 (1991)). The location of CDR residues according to a sequence definition (Kabat et al., 
1991) and a structural definition (Chothia and Lesk, supra 1987) are shown by a line and carats ( A ) beneath the 
sequences, respectively. 

FIGURE 6A compares murine and humanized amino acid sequences for the heavy chain of an anti-CD18 antibody. 
40 H52H4-160 (SEQ. ID. NO. 22) is the murine sequence, and pH52-8.0 (SEQ. ID. NO. 23) is the humanized heavy 
chain sequence. pH52-8.0 residue 143S is the final amino acid in the variable heavy chain domain V H , and residue 
144A is the first amino acid in the constant heavy chain domain C H i . 

FIGURE 6B compares murine and humanized amino acid sequences for the light chain of an anti-CD18 antibody. 
H52L6-158 (SEQ. ID. NO. 24) is the murine sequence, and pH52-9.0 (SEQ. ID. NO. 25) is the humanized light 
45 chain sequence. pH52-9.0 residue 128T is the final amino acid in the light chain variable domain V L , and residue 
129V is the f irst amino acid in the light chain constant domain C L . 

Detailed Description of the Invention 

so Definitions 

[0031] In general, the following words or phrases have the indicated definitions when used in the description, exam- 
ples, and claims: 

[0032] The murine monoclonal antibody known as muMAb4D5 (Fendly, B. M. et al., Cancer Res. 50:1550- 
55 1558(1990)) is directed against the extracellular domain (ECD) of p185 HER2 . The muMAb4D5 and its uses are 
described in PCT application WO 89/06692 published 27 July 1 989. This murine antibody was deposited with the ATCC 
and designated ATCC CRL 10463. In this description and claims, the terms muMAb4D5, chMAb4D5 and huMAb4D5 
represent murine, chimerized and humanized versions of the monoclonal antibody 4D5, respectively. 
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[0033] A humanized antibody for the purposes herein is an immunoglobulin amino acid sequence variant or fragment 
thereof which is capable of binding to a predetermined antigen and which comprises a FR region having substantially 
the amino acid sequence of a human immunoglobulin and a CDR having substantially the amino acid sequence of a 
non-human immunoglobulin. 

s [0034] Generally, a humanized antibody has one or more amino acid residues introduced into it from a source which 
is non-human. These non-human amino acid residues are referred to herein as "import" residues, which are typically 
taken from an "import" antibody domain, particularly a variable domain. An import residue, sequence, or antibody has 
a desired affinity and/or specificity, or other desirable antibody biological activity as discussed herein. 
[0035] In general, the humanized antibody will comprise substantially all of at least one, and typically two, variable 

w domains (Fab, Fab', F(ab*) 2 . Fabc, Fv) in which all or substantially all of the CDR regions correspond to those of a non- 
human immunoglobulin and all or substantially all of the FR regions are those of a human immunoglobulin consensus 
sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region 
(Fc), typically that of a human immunoglobulin. Ordinarily, the antibody will contain both the. light chain as well as at 
least the variable domain of a heavy chain. The antibody also may include the CH1 , hinge, CH2, CH3, and CH4 regions 

15 of the heavy chain. 

[0036] The humanized antibody will be selected from any class of immunoglobulins, including IgM, IgG, IgD, IgA and 
IgE, and any isotype, including lgG1, lgG2, lgG3 and lgG4. Usually the constant domain is a complement fixing con- 
stant domain where it is desired that the humanized antibody exhibit cytotoxic activity, and the class is typically IgG^ 
Where such cytotoxic activity is not desirable, the constant domain may be of the lgG 2 class. The humanized antibody 
20 may comprise sequences from more than one class or isotype, and selecting particular constant domains to optimize 
desired effector functions is within the ordinary skill in the art. 

[0037] The FR and CDR regions of the humanized antibody need not correspond precisely to the parental sequences, 
e.g., the import CDR or the consensus FR may be mutagenized by substitution, insertion or deletion of at least one res- 
idue so that the CDR or FR residue at that site does not correspond to either the consensus or the import antibody 

25 Such mutations, however, will not be extensive. Usually, at least 75% of the humanized antibody residues will corre- 
spond to those of the parental FR and CDR sequences, more often 90%, and most preferably greater than 95%. 
[0038] In general, humanized antibodies prepared by the method of this invention are produced by a process of anal- 
ysis of the parental sequences and various conceptual humanized products using three dimensional models of the 
parental and humanized sequences. Three dimensional immunoglobulin models are commonly available and are famil- 

30 iar to those skilled in the art. Computer programs are available which illustrate and display probable three dimensional 
conformational structures of selected candidate immunoglobulin sequences. Inspection of these displays permits anal- 
ysis of the likely role of the residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of 
residues that influence the ability of the candidate immunoglobulin to bind its antigen. 

[0039] Residues that influence antigen binding are defined to be residues that are substantially responsible for the 
35 antigen affinity or antigen specificity of a candidate immunoglobulin, in a positive or a negative sense. The invention is 
directed to the selection and combination of FR residues from the consensus and import sequence so that the desired 
immunoglobulin characteristic is achieved. Such desired characteristics include increases in affinity and greater specif- 
icity for the target antigen, although it is conceivable that in some circumstances the opposite effects might be desired. 
In general, the CDR residues are directly and most substantially involved in influencing antigen binding (although not 
40 all CDR residues are so involved and therefore need not be substituted into the consensus sequence). However, FR 
residues also have a significant effect and can exert their influence in at least three ways: They may noncovalently 
directly bind to antigen, they may interact with CDR residues and they may affect the interface between the heavy and 
light chains. 

[0040] A residue that noncovalently directly binds to antigen is one that, by three dimensional analysis, is reasonably 
45 expected to noncovalently directly bind to antigen. Typically, it is necessary to impute the position of antigen from the 
spatial location of neighboring CDRs and the dimensions and structure of the target antigen. In general, only those 
humanized antibody residues that are capable of forming salt bridges, hydrogen bonds, or hydrophobic interactions are 
likely to be involved in non-covalent antigen binding, however residues which have atoms which are separated from 
antigen spatially by 3.2 Angstroms or less may also non-covalently interact with antigen. Such residues typically are the 
so relatively larger amino acids having the side chains with the greatest bulk, such as tyrosine, arginine, and lysine. Anti- 
gen-binding FR residues also typically will have side chains that are oriented into an envelope surrounding the solvent 
oriented face of a CDR which extends about 7 Angstroms into the solvent from the CDR domain and about 7 Angstroms 
on either side of the CDR domain, again as visualized by three dimensional modeling. 

[0041] A residue that interacts with a CDR generally is a residue that either affects the conformation of the CDR 
55 polypeptide backbone or forms a noncovalent bond with a CDR residue side chain. Conformation-affecting residues 
ordinarily are those that change the spatial position of any CDR backbone atom (N, Ca, C, O, Cp) by more than about 
0.2 Angstroms. Backbone atoms of CDR sequences are displaced for example by residues that interrupt or modify 
organized structures such as beta sheets, helices or loops. Residues that can exert a profound affect on the confer ma- 
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tion of neighboring sequences include proline and glycine, both of which are capable of introducing bends into the back- 
bone. Other residues that can displace backbone atoms are those that are capable of participating in salt bridges and 
hydrogen bonds. 

[0042] A residue that interacts with a CDR side chain is one that is reasonably expected to form a noncovalent bond 

5 with a CDR side chain, generally either a salt bridge or hydrogen bond. Such residues are identified by three dimen- 
sional positioning of their side chains. A salt or ion bridge could be expected to form between two side chains positioned 
within about 2.5 - 3.2 Angstroms of one another that bear opposite charges, for example a lysinyl and a glutamyl pairing. 
A hydrogen bond could be expected to form between the side chains of residue pairs such as seryl or threonyl with 
aspartyl or glutamyl (or other hydrogen accepting residues). Such pairings are well known in the protein chemistry art 

10 and will be apparent to the artisan upon three dimensional modeling of the candidate immunoglobulin. 

[0043] Immunoglobulin residues that affect the interface between heavy and light chain variable regions ("the V L - V H 
interface") are those that affect the proximity or orientation of the two chains with respect to one another. Certain resi- 
dues involved in interchain interactions are already known and include V L residues 34, 36, 38, 44, 46, 87, 89, 91 , 96, 
and 98 and V H residues 35, 37, 39, 45, 47, 91 , 93, 95, 100, and 103 (utilizing the nomenclature set forth in Kabat et al., 

15 Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)). Additional resi- 
dues are newly identified by the inventors herein, and include 43L, 85L, 43H and 60H. While these residues are indi- 
cated for IgG only, they are applicable across species. In the practice of this invention, import antibody residues that are 
reasonably expected to be involved in interchain interactions are selected for substitution into the consensus sequence. 
It is believed that heretofore no humanized antibody has been prepared with an intrachain-affecting residue selected 

20 from an import antibody sequence. 

[0044] Since it is not entirely possible to predict in advance what the exact impact of a given substitution will be it may 
be necessary to make the substitution and assay the candidate antibody for the desired characteristic. These steps, 
however, are per se routine and well within the ordinary skill of the art. 

[0045] CDR and FR residues are determined according to a standard sequence definition (Kabat et al., Sequences 

25 of Proteins of Immunological Interest, National Institutes of Health, Bethesda MD (1987), and a structural definition (as 
in Chothia and Lesk, J. Mol. Biol. 196:901-917 (1987). Where these two methods result in slightly different identifica- 
tions of a CDR, the structural definition is preferred, but the residues identified by the sequence definition method are 
considered important FR residues for determination of which framework residues to import into a consensus sequence. 
[0046] Throughout this description, reference is made to the numbering scheme from Kabat, E. A., et al., Sequences 

30 of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD (1987) and (1991). In these compen- 
diums, Kabat lists many amino acid sequences for antibodies for each subclass, and lists the most commonly occurring 
amino acid for each residue position in that subclass. Kabat uses a method for assigning a residue number to each 
amino acid in a listed sequence, and this method for assigning residue numbers has become standard in the field. The 
Kabat numbering scheme is followed in this description. For purposes of this invention, to assign residue numbers to a 

35 candidate antibody amino acid sequence which is not included in the Kabat compendium, one follows the following 
steps. Generally, the candidate sequence is aligned with any immunoglobulin sequence or any consensus sequence in 
Kabat. Alignment may be done by hand, or by computer using commonly accepted computer programs; an example of 
such a program is the Align 2 program discussed in this description. Alignment may be facilitated by using some amino 
acid residues which are common to most Fab sequences. For example, the light and heavy chains each typically have 

40 two cysteines which have the same residue numbers; in V L domain the two cysteines are typically at residue numbers 
23 and 88, and in the V H domain the two cysteine residues are typically numbered 22 and 92. Framework residues gen- 
erally, but not always, have approximately the same number of residues, however the CDRs will vary in size. For exam- 
pie, in the case of a CDR from a candidate sequence which is longer than the CDR in the sequence in Kabat to which 
it is aligned, typically suffixes are added to the residue number to indicate the insertion of additional residues (see, e.g. 

45 residues 1 0Oabcde in fig. 5). For candidate sequences which, for example, align with a Kabat sequence for residues 34 
and 36 but have no residue between them to align with residue 35, the number 35 is simply not assigned to a residue. 
[0047] Thus, in humanization of an import variable sequence, where one cuts out an entire human or consensus CDR 
and replaces it with an import CDR sequence, (a) the exact number of residues may be swapped, leaving the number- 
ing the same, (b) fewer import amino acid residues may be introduced than are cut, in which case there will be a gap in 

so the residue numbers, or (c) a larger number of amino acid residues may be introduced then were cut, in which case the 
numbering will involve the use of suffixes such as 100abcde. 

[0048] The terms "consensus sequence" and "consensus antibody" as used herein refers to an amino acid sequence 
which comprises the most frequently occurring amino acid residues at each location in all immunoglobulins of any par- 
ticular subclass or subunit structure. The consensus sequence may be based on immunoglobulins of a particular spe- 
55 cies or of many species. A "consensus" sequence, structure, or antibody is understood to encompass a consensus 
human sequence as described in certain embodiments of this invention, and to refer to an amino acid sequence which 
comprises the most frequently occurring amino acid residues at each location in all human immunoglobulins of any par- 
ticular subclass or subunit structure. This invention provides consensus human structures and consensus structures 
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which consider other species in addition to human. 

[0049] The subunit structures of the five immunoglobulin classes in humans are as follows: 



10 
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[0050] In preferred embodiments of an IgGyl human consensus sequence, the consensus variable domain 
sequences are derived from the most abundant subclasses in the sequence compilation of Kabat et al., Sequences of 
Proteins of Immunological Interest, National Institutes of Health, Bethesda MD (1987), namely V L k subgroup I and V H 
group ill. In such preferred embodiments, the V L consensus domain has the amino acid sequence: 



25 



DIQMTaSPSSLSASVGDRVTrrCRASQDVSSYLA^ 

SGSGTDFTLTISSLOPEDFATYYCQQYNSLPYTFGQGTKVEIKRT (SEQ. ID NO. 3>; 



the V H consensus domain has the amino acid sequence: 

30 EVQLVESGGGLVQPGGSLRLSCAASGFTFSDYAMSWVRQAPGKGLEWVAV1SENGGYTRYAD 
SVKGRFTISAOTSK^AYLQMNSLI^EDTA\nfTCSRWGGDGFYAMD\^ 
ID NO. 4). 

35 

These sequences include consensus CDRs as well as consensus FR residues (see for example in Fig. 1). 
[0051 ] While not wishing to be limited to any particular theories, it may be that these preferred embodiments are less 
likely to be immunogenic in an individual than less abundant subclasses. However, in other embodiments, the consen- 
sus sequence is derived from other subclasses of human immunoglobulin variable domains. In yet other embodiments, 

40 the consensus sequence is derived from human constant domains. 

[0052] Identity or homology with respect to a specified amino acid sequence of this invention is defined herein as the 
percentage of amino acid residues in a candidate sequence that are identical with the specified residues, after aligning 
the sequences and introducing gaps, if necessary, to achieve the maximum percent homology, and not considering any 
conservative substitutions as part of the sequence identity. None of N-terminal, C-terminal or internal extensions, dele- 

45 tions, or insertions into the specified sequence shall be construed as affecting homology. All sequence alignments 
called for in this invention are such maximal homology alignments. While such alignments may be done by hand using 
conventional methods, a suitable computer program is the "Align 2" program for which protection is being sought from 
the U.S. Register of Copyrights (Align 2, by Genentech, Inc., application filed 9 December 1991). 
[0053] "Non-homologous" import antibody residues are those residues which are not identical to the amino acid res- 

50 idue at the analogous or corresponding location in a consensus sequence, after the import and consensus sequences 
are aligned. 

[0054] The term "computer representation" refers to information which is in a form that can be manipulated by a com- 
puter. The act of storing a computer representation refers to the act of placing the information in a form suitable for 
manipulation by a computer. 

55 [0055] This invention is also directed to novel polypeptides, and in certain aspects, isolated novel humanized anti- 
p185 HER2 antibodies are provided. These novel anti-p185 HER2 antibodies are sometimes collectively referred to herein 
as huMAb4D5, and also sometimes as the light or heavy chain variable domains of huMAb4D5, and are defined herein 
to be any polypeptide sequence which possesses a biological property of a polypeptide comprising the following 
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polypeptide sequence: 

DIQMTQSPSSLSASVGDRVmCRASQDVNTAVAWYQQKPGKAPKLLIYSASFLESGVP 
SRFSGSRSGTDFTLTISSLOPEDFATYYCQQHYTTPPTFGQGTKVEIKRT (SEQ. ID NO. 1, 
which is the light chain variable domain of huMAMD5); or 
EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHVWRQAP 
YADSVKGRFTISADTSKfn'AYLQMNSlBAEDTAVYYCSRWGGDGFYAMDVW 
VTVSS (SEQ. ID NO. 2, which b the heavy chain variable domain of huMAb4D5). 



is [0056] "Biological property", as relates for example to artti-p1 85 HER2 , for the purposes herein means an in vivo effec- 
tor or antigen-binding function or activity that is directly or indirectly performed by huMAb4D5 (whether in its native or 
denatured conformation). Effector functions include p185 HER2 binding, any hormonal or hormonal antagonist activity, 
any mitogenic or agonist or antagonist activity, any cytotoxic activity. An antigenic function means possession of an 
epitope or antigenic site that is capable of cross-reacting with antibodies raised against the polypeptide sequence of 

20 huMAb4D5. 

[0057] Biologically active huMAb4D5 is defined herein as a polypeptide that shares an effector function of huMAb4D5. 
A principal known effector function of huMAb4D5 is its ability to bind to p185 HER2 . 

[0058] Thus, the biologically active and arttigenically active huMAb4D5 polypeptides that are the subject of certain 
embodiments of this invention include the sequence of the entire translated nucleotide sequence of huMAb4D5; mature 

25 huMAb4D5; fragments thereof having a consecutive sequence of at least 5, 10, 15, 20, 25, 30 or 40 amino acid resi- 
dues comprising sequences from muMAb4D5 plus residues from the human FR of huMAb4D5; amino acid sequence 
variants of huMAb4D5 wherein an amino acid residue has been inserted N- or C-terminal to, or within, huMAb4D5 or 
its fragment as defined above; amino acid sequence variants of huMAb4D5 or its fragment as defined above wherein 
an amino acid residue of huM AMDS or its fragment as defined above has been substituted by another residue, inctud- 

30 ing predetermined mutations by, e.g., site-directed or PCR mutagenesis; derivatives of huMAb4D5 or its fragments as 
defined above wherein huMAb4D5 or its fragments have been covalent modified, by substitution, chemical, enzymatic, 
or other appropriate means, with a moiety other than a naturally occurring amino acid; and glycosylation variants of 
huM Ab4D5 (insertion of a glycosylation site or deletion of any glycosylation site by deletion, insertion or substitution of 
suitable residues). Such fragments and variants exclude any polypeptide heretofore identified, including muMAb4D5 or 

35 any known polypeptide fragment, which are anticipatory order 35 U.S.C.1 02 as well as polypeptides obvious thereover 
under 35 U.S.C. 103. 

[0059] An "isolated" polypeptide means polypeptide which has been identified and separated and/or recovered from 
a component of its natural environment. Contaminant components of its natural environment are materials which would 
interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other pro- 

40 teinaceous or nonproteinaceous solutes. In preferred embodiments, for example, a polypeptide product comprising 
huMAb4D5 will be purified from a all culture or other synthetic environment (1) to greater than 95% by weight of protein 
as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree sufficient to obtain 
at least 15 residues of N-terminal or internal amino acid sequence by use of a gas- or liquid-phase sequenator (such 
as a commercially available Applied Biosystems sequenator Model 470, 477, or 473), or (3) to homogeneity by SDS- 

45 PAGE under reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated huMAb4D5 
includes huMAb4D5 in silu within recombinant cells since at least one component of the huMAb4D5 natural environ- 
ment will not be present. Ordinarily, however, isolated huMAb4D5 will be prepared by at least one purification step. 
[0060] In accordance with this invention, huMAb4D5 nucleic acid is RNA or DNA containing greater than ten bases 
that encodes a biologically or antigenically active huMAb4D5, is complementary to nucleic acid sequence encoding 

so such huMAb4D5, or hybridizes to nucleic acid sequence encoding such huMAb4D5 and remains stably bound to it 
under stringent conditions, and comprises nucleic acid from a muMAb4D5 CDR and a human FR region. 
[0061] Preferably, the huMAb4D5 nucleic acid encodes a polypeptide sharing at least 75% sequence identity, more 
preferably at least 80%, still more preferably at least 85%, even more preferably at 90%, and most preferably 95%, with 
the huM AMDS amino acid sequence. Preferably, a nucleic acid molecule that hybridizes to the huMAb4D5 nucleic acid 

55 contains at least 20, more preferably 40, and most preferably 90 bases. Such hybridizing or complementary nucleic 
acid, however, is further defined as being novel under 35 U.S.C. 102 and unobvious under 35 U.S.C. 103 over any prior 
art nucleic acid. 

[0062] Stringent conditions are those that (1) employ low ionic strength and high temperature for washing, for exam- 
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pie, 0.015 M NaCI/0.0015 M sodium citrate/0/1 % NaDodS0 4 at 50° C; (2) employ during hybridization a denaturing 
agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0/1 % Ficoll/0/1 % 
polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCI, 75 mM sodium citrate at 42° C; or 
(3) employ 50% formamide, 5 x SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 
5 sodium pyrophosphate, 5 x Denhardt s solution, sonicated salmon sperm DNA (50 g/ml), 0.1 % SDS, and 10% dextran 
sulfate at 42 C, with washes at 42 C in 0.2 x SSC and 0.1% SDS. 

[0063] The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, 
include a promoter, optionally an operator sequence, a ribosome binding site, and possibly, other as yet poorly under- 

10 stood sequences. Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

[0064] Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is 
expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably 
linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked 

is to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA 
sequences being linked are contiguous and, in the case of a secretory leader, contiguous and in reading phase. How- 
ever enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such 
sites do not exist, then synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice. 
[0065] An "exogenous" element is defined herein to mean nucleic acid sequence that is foreign to the cell, or homol- 

20 ogous to the cell but in a position within the host cell nucleic acid in which the element is ordinarily not found. 

[0066] As used herein, the expressions "cell," "cell line," and "cell culture" are used interchangeably and all such des- 
ignations include progeny. Thus, the words "transformants" and "transformed cells" include the primary subject cell and 
cultures derived therefrom without regard for the number of transfers. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the same func- 

25 tion or biological activity as screened for in the originally transformed cell are included. Where distinct designations are 
intended, it will be clear from the context. 

[0067] "Oligonucleotides" are short-length, single- or double-stranded polydeoxynucleotides that are chemically syn- 
thesized by known methods (such as phosphotriester, phosphite, or phosphoramidite chemistry, using solid phase tech- 
niques such as described in EP 266,032 published 4 May 1988, or via deoxynucleoside H-phosphonate intermediates 

30 as described by Froehler et a/., Nucl. Acids Res.. 14; 5399-5407 [1 986]). They are then purified on polyacrylamide gels. 
[0068] The technique of "polymerase chain reaction," or "PCR," as used herein generally refers to a procedure 
wherein minute amounts of a specific piece of nucleic acid, RNA and/or DNA, are amplified as described in U.S. Pat. 
No. 4,683,195 issued 28 July 1987. Generally, sequence information from the ends of the region of interest or beyond 
needs to be available, such that oligonucleotide primers can be designed: these primers will be identical or similar in 

35 sequence to opposite strands of the template to be amplified. The 5' terminal nucleotides of the two primers may coin- 
cide with the ends of the amplified material. PCR can be used to amplify specific RNA sequences, specific DNA 
sequences from total genomic DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid 
sequences, etc. See generally Mullis et al., Cold Spring Harbor Svmo. Quant. Biol.. 51: 263 (1987); Erlich, ed.. PCR 
Technology. (Stockton Press, NY, 1989). As used herein, PCR is considered to be one. but not the only, example of a 

40 nucleic acid polymerase reaction method for amplifying a nucleic acid test sample, comprising the use of a known 
nucleic acid (DNA or RNA) as a primer and utilizes a nucleic acid polymerase to amplify or generate a specific piece of 
nucleic acid or to amplify or generate a specific piece of nucleic acid which is complementary to a particular nucleic 
acid. 

45 Suitable Meth ods for Practicing the Invention 

[0069] Some aspects of this invention include obtaining an import, non-human antibody variable domain, producing 
a desired humanized antibody sequence and for humanizing an antibody gene sequence are described below. A par- 
ticularly preferred method of changing a gene sequence, such as gene conversion from a non-human or consensus 
so sequence into a humanized nucleic acid sequence, is the cassette mutagenesis procedure described in Example 1. 
Additionally, methods are given for obtaining and producing antibodies generally, which apply equally to native non- 
human antibodies as well as to humanized antibodies. 

[0070] Generally, the antibodies and antibody variable domains of this invention are conventionally prepared in recom- 
binant cell culture, as described in more detail below. Recombinant synthesis is preferred for reasons of safety and 
55 economy, but it is known to prepare peptides by chemical synthesis and to purify them from natural sources; such prep- 
arations are included within the definition of antibodies herein. 
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Molecular Modeling 

[0071 ] An integral step in our approach to antibody humanization is construction of computer graphics models of the 
import and humanized antibodies. These models are used to determine if the six complementarity-determining regions 

5 (CDRs) can be successfully transplanted from the import framework to a human one and to determine which framework 
residues from the import antibody, if any, need to be incorporated into the humanized antibody in order to maintain CDR 
conformation. In addition, analysis of the sequences of the import and humanized antibodies and reference to the mod- 
els can help to discern which framework residues are unusual and thereby might be involved in antigen binding or main- 
tenance of proper antibody structure. 

io [0072] All of the humanized antibody models of this invention are based on a single three-dimensional computer 
graphics structure hereafter referred to as the consensus structure. This consensus structure is a key distinction from 
the approach of previous workers in the field, who typically begin by selecting a human antibody structure which has an 
amino acid sequence which is similar to the sequence of their import antibody. 

[0073] The consensus structure of one embodiment of this invention was built in five steps as described below. 
is [0074] Step 1 : Sever Fab X-ray crystal structures from the Brookhaven Protein Data Bank were used (entries 2FB4, 
2RHE, 3 FAB, and 1REI which are human structures, and 2MCP, 1FBJ, and 2HFL which are murine structures). For 
each structure, protein mainchain geometry and hydrogen bonding patterns were used to assign each residue to one 
of three secondary structure types: alpha-helix, beta-strand or other (i.e. non-helix and non-strand). The immunoglob- 
ulin residues used in superpositioning and those included in the consensus structure are shown in Table 1 . 
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Table 1 
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Immunoglobulin Residues Used in Superpositioning and Those Included in the Consensus Struc- 
ture 
V l k domain 






2FB4 


2RHE 


2MCP 


3 FAB 


1FBJ 


2HFL 


1REI 


Consensus b 




















2-11 


10 




18-24 


18-24 


19-25 


18-24 


19-25 


19-25 


19-25 


16-27 






32-37 


34-39 


39-44 


32-37 


32-37 


32-37 


33-38 


33-39 
41-49 


15 




60-66 


62-68 


67-72 


53-66 


60-65 


60-65 


61-66 


59-77 






69-74 


71-76 


76-81 


69-74 


69-74 


69-74 


70-75 








84-88 


86-90 


91-95 


84-88 


84-88 


84-88 


85-89 


82-91 


20 


RMS C 




0.40 


0.60 


0.53 


0.54 


0.48 


0.50 




V H domain 






2FB4 


2MCP 


3 FAB 


1FBJ 


2HFL 


Consensus 13 
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18-25 


18-25 


18-25 


18-25 


17-23 










34-39 


34-39 


34-39 


34-39 


34-39 


33-41 










46-52 


46-52 


46-52 


46-52 


46-52 


45-51 










57-61 


59-63 


56-60 


57-61 


57-61 


57-61 






30 




68-71 


70-73 


67-70 


68-71 


68-71 


66-71 










78-84 


80-86 


77-83 


78-84 


78-84 


75-82 










92-99 


94-101 


91-98 


92-99 


92-99 


88-94 






35 


RMS C 
RMS d 


0.91 


0.43 
0.73 


0.85 
0.77 


0.62 
0.92 


0.91 


102-108 







a Four- letter code for Protein Data Bank file. 

b Residue numbers tor the crystal structures are taken from the Protein Data Bank files. Residue numbers 

for the consensus structure are according to Kabat et al. 

c Root- mean-square deviation in A tor (N.Ca.C) atoms superimposed on 2FB4. 

d Root-mean-square deviation in A for (N.Ca.C) atoms superimposed on 2HFL 



45 

[0075J Step 2: Haying identified the alpha-helices and beta-strands in each of the seven structures, the structures 
were superimposed on one another using the INSIGHT computer program (Biosym Technologies, San Diego, CA) as 
follows: The 2FB4 structure was arbitrarily chosen as the template (or reference) structure. The 2FB4 was held fixed in 
space and the other six structures rotated and translated in space so that their common secondary structural elements 
so (i.e. alpha-helices and beta-strands) were oriented such that these common elements were as close in position to one 
another as possible. (This superimposeing was performed using accepted mathematical formulae rather than actually 
physically moving the structures by hand.) 

[0076] Step 3: With the seven structures thus superimposed, for each residue in the template (2FB4) Fab one calcu- 
lates the distance from the template alpha-carbon atom (Ca) to the analogous Ca atom in each of the other six super- 
55 imposed structures. This results in a table of Ca-Ca distances for each residue position in the sequence. Such a table 
is necessary in order to determine which residue positions will be included in the consensus model. Generally, if all Ca- 
Ca distances for a given residue position were £ 1 .OA, that position was included in the consensus structure. If for a 
given position only one Fab crystal structure was > 1 .OA, the position was included but the outlying crystal structure was 
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not included in the next step (for this position only). In general, the seven p-strands were included in the consensus 
structure while some of the loops connecting the p-strands, e.g. complementarity-determining regions (CDRs), were 
not included in view of Ca divergence. 

[0077] Step 4: For each residue which was included in the consensus structure after step 3, the average of the coor- 
5 dinates for individual mainchain N, Ca, C, O and Cp atoms were calculated. Due to the averaging procedure, as well as 
variation in bond length, bond angle and dihedral angle among the crystal structures, this "average" structure contained 
some bond lengths and angles which deviated from standard geometry. For purposes of this invention, "standard geom- 
etry" is understood to include geometries commonly accepted as typical, such as the compilation of bond lengths and 
angles from small molecule structures in Weiner, S.J. et al., J. Amer. Chem. Soc, 106: 765-784 (1984). 
10 [0078] Step 5: In order to correct these deviations, the final step was to subject the "average" structure to 50 cycles 
of energy minimization (DISCOVER program, Biosym Technologies) using the AMBER (Weiner, S.J. et. al., J. Amer. 
Chem. Soc., 1 06: 765-784 (1 984)) parameter set with only the Ca coordinates fixed (i.e. all other atoms are allowed to 
move) (energy minimization is described below). This allowed any deviant bond lengths and angles to assume a stand- 
ard (chemically acceptable) geometry. See Table II. 

15 



Table II 



20 


Average Bond Lengths and Angles for "Average" (Before) and Energy-Minimized Consensus (After 50 Cycles) 

Structures 




V|_k before (A) 


V l k after (A) 


V H before (A) 


V H after (A) 


Standard Geome- 
try (A) 




N-Ca 


1.459(0.012) 


1.451(0.004) 


1 .451 (0.023) 


1.452(0.004) 


1.449 


25 


Ca-C 


1.515(0.012) 


1.523(0.005) 


1 .507(0.033) 


1.542(0.005) 


1.522 


0=C 


1.208(0.062) 


1.229(0.003) 


1.160(0.177) 


1.231(0.003) 


1.229 




C-N 


1 .288(0.049) 


1.337(0.002) 


1 .282(0.065) 


1 .335(0.004) 


1.335 




Ca-Cp 


1.508(0.026) 


1 .530(0.002) 


1 .499(0.039) 


1.530(0.002) 


1.526 


30 




C) 


C) 


n 


O 


C) 




C-N-Ca 


123.5(4.2) 


123.8(1.1) 


125.3(4.6) 


124.0(1.1) 


121.9 




N-Ca-C 


110.0(4.0) 


109.5(1.9) 


110.3(2.8) 


109.5(1.6) 


110.1 


35 


Ca-C-N 


116.6(4.0) 


116.6(1.2) 


117.6(5.2) 


116.6(0.8) 


116.6 




0=C-N 


123.1(4.1) 


123.4(0.6) 


122.2(4.9) 


123.3(0.4) 


122.9 




N-Ca-Cp 


110.3(2.1) 


109.8(0.7) 


110.6(2.5) 


109.8(0.6) 


109.5 




Cp-Ca-C 


111.4(2.4) 


111.1(0.7) 


1 1 1 .2(2.2) 


111.1(0.6) 


111.1 


40 


Values in parentheses are standard deviations. Note that while some bond length and angle averages did not 
change appreciably after energy-minimization, the corresponding standard deviations are reduced due to deviant 
geometries assuming standard values after energy-minimization. Standard geometry values are from the AMBER 
forcefield as implemented in DISCOVER (Biosym Technologies). 



45 

[0079] The consensus structure might conceivably be dependent upon which crystal structure was chosen as the 
template on which the others were superimposed. As a test, the entire procedure was repeated using the crystal struc- 
ture with the worst superposition versus 2FB4, i.e. the 2HFL Fab structure, as the new template (reference). The two 
consensus structures compare favorably (root-mean-squared deviation of 0.1 1 A for all N, Ca and C atoms). 

so [0080] Note that the consensus structure only includes mainchain (N, Ca, C, O. Cp atoms) coordinates for only those 
residues which are part of a conformation common to all seven X-ray crystal structures. For the Fab structures, these 
include the common p-strands (which comprise two p-sheets) and a few non-CDR loops which connect these p- 
strands. The consensus structure does not include CDRs or sidechains, both of which vary in their conformation among 
the seven structures. Also, note that the consensus structure includes only the VL and VH domains. 

55 [0081 ] This consensus structure is used as the archetype. It is not particular to any species, and has only the basic 
shape without side chains. Starting with this consensus structure the model of any import, human, or humanized Fab 
can be constructed as follows. Using the amino acid sequence of the particular antibody VL and VH domains of interest, 
a computer graphics program (such as INSIGHT, Biosym Technologies) is used to add sidechains and CDRs to the con- 
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sensus structure. When a sidechain is added, its conformation is chosen on the basis of known Fab crystal structures 
(see the Background section for publications of such crystal structures) and rotamer libraries (Ponder, J.W. & Richards, 
R M„ J. Mol. Biol. 193: 775-791 (1987)). The model also is constructed so that the atoms of the sidechain are posi- 
tioned so as to not collide with other atoms in the Fab. 

[0082] CDRs are added to the model (now having the backbone plus side chains) as follows. The size (i.e. number of 
amino acids) of each import CDR is compared to canonical CDR structures tabulated by Chothia et al., Nature, 
342:877-883 (1989)) and which were derived from Fab crystals. Each CDR sequence is also reviewed for the presence 
or absence of certain specific amino acid residues which are identified by Chothia as structurally important: e.g. light 
chain residues 29 (CDR1) and 95 (CDR3), and heavy chain residues 26, 27, 29 (CDR1) and 55 (CDR2). For light chain 
CDR2, and heavy chain CDR3, only the size of the CDR is compared to the Chothia canonical structure. If the size and 
sequence (i.e. inclusion of the specific, structurally important residues as denoted by Chothia et al.) of the import CDR 
agrees in size and has the same structurally important residues as those of a canonical CDR, then the mainchain con- 
formation of the import CDR in the model is taken to be the same as that of the canonical CDR. This means that the 
import sequence is assigned the structural configuration of the canonical CDR, which is then incorporated in the evolv- 
ing model. 

[0083] However, if no matching canonical CDR can be assigned for the import CDR, then one of two options can be 
exercised. First, using a program such as INSIGHT (Biosym Technologies), the Brookhaven Protein Data Bank can be 
searched for loops with a similar size to that of the import CDR and these loops can be evaluated as possible confor- 
mations for the import CDR in the model. Minimally, such loops must exhibit a conformation in which no loop atom over- 
laps with other protein atoms. Second, one can use available programs which calculate possible loop conformations, 
assuming a given loop size, using methods such as described by Bruccoleri et ai, Nature 335: 564-568 (1988). 
[0084] When all CDRs and sidechains have been added to the consensus structure to give the final model (import, 
human or humanized), the model is preferably subjected to energy minimization using programs which are available 
commercially (e.g. DISCOVER, Biosym Technologies). This technique uses complex mathematical formulae to refine 
the model by performing such tasks as checking that all atoms are within appropriate distances from one another and 
checking that bond lengths and angles are within chemically acceptable limits. 

[0085] Models of a humanized, import or human antibody sequence are used in the practice of this invention to under- 
stand the impact of selected amino acid residues of the activity of the sequence being modeled. For example, such a 
model can show residues which may be important in antigen binding, or for maintaining the conformation of the anti- 
body, as discussed in more detail below. Modeling can also be used to explore the potential impact of changing any 
amino acid residue in the antibody sequence. 

Methods for Obtaining a Humanized Antibody Sequence 

35 [0086] In the practice of this invention, the first step in humanizing an import antibody is deriving a consensus amino 
acid sequence into which to incorporate the import sequences. Next a model is generated for these sequences using 
the methods described above. In certain embodiments of this invention, the consensus human sequences are derived 
from the most abundant subclasses in the sequence compilation of Kabat et al. (Kabat, E. A. et ai, Sequences of Pro- 
teins of immunological Interest (National Institutes of Health, Bethesda, MD, 1987)), namely V L k subgroup I and V H 

40 group III, and have the sequences indicated in the definitions above. 

[0087] While these steps may be taken in different order, typically a structure for the candidate humanized antibody , 
is created by transferring the at least one CDR from the non-human, import sequence into the consensus human struc- 
ture, after the entire corresponding human CDR has been removed. The humanized antibody may contain human 
replacements of the non-human import residues at positions within CDRs as defined by sequence variability (Kabat, E. 

45 A. et at., Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)) or as 
defined by structural variability (Chothia, C. & Lesk, A. M., J. Mol. Biol. 196:901-917 (1987)). For example, huMAb4D5 
contains human replacements of the muMAb4D5 residues at three positions within CDRs as defined by sequence var- 
iability (Kabat, E. A. et al., Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, 
MD, 1987)) but not as defined by structural variability (Chothia, C. & Lesk, A. M., J. Mol. Biol. 196:901 -91 7 (1987)): V L - 

50 CDR1 K24R, V L -CDR2 R54L and V L -CDR2 T56S. 

[0088] Differences between the non-human import and the human consensus framework residues are individually 
investigated to determine their possible influence on CDR conformation and/or binding to antigen. Investigation of such 
possible influences is desirably performed through modeling, by examination of the characteristics of the amino acids 
at particular locations, or determined experimentally through evaluating the effects of substitution or mutagenesis of 

55 particular amino acids. 

[0089] In certain preferred embodiments of this invention, a humanized antibody is made comprising amino acid 
sequence of an import, non-human antibody and a human antibody, utilizing the steps of: 
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a. obtaining the amino acid sequences of at least a portion of an import antibody variable domain and of a consen- 
sus human variable domain; 

b. identifying Complementarity Determining Region (CDR) amino acid sequences in the import and the human var- 
iable domain sequences; 

s c. substituting an import CDR amino acid sequence for the corresponding human CDR amino acid sequence; 

d. aligning the amino acid sequences of a Framework Region (FR) of the import antibody and the corresponding 
FR of the consensus antibody; 

e. identifying import antibody FR residues in the aligned FR sequences that are non-homologous to the corre- 
sponding consensus antibody residues; 

10 f. determining if the non-homologous import amino acid residue is reasonably expected to have at least one of the 
following effects: 

1 . non-covalently binds antigen directly, 

2. interacts with a CDR; or 

is 3. participates in the V L - V H interface; and 



g. for any non-homologous import antibody amino acid residue which is reasonably expected to have at least one 
of these effects, substituting that residue for the corresponding amino acid residue in the consensus antibody FR 
sequence. 

20 

[0090] Optionally, one determines if any non-homologous residues identified in step (e) are exposed on the surface 
of the domain or buried within it, and if the residue is exposed but has none of the effects identified in step (f), one may 
retain the consensus residue. 

[0091] Additionally, in certain embodiments the corresponding consensus antibody residues identified in step (e) 
25 above are selected from the group consisting of 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 
68L, 69L, 70L, 71 L, 73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H. 67H, 68H, 69H, 70H, 
73H, 74H, 75H, 76H, 78H, 91 H, 92H, 93H, and 103H (utilizing the numbering system set forth in Kabat, E. A. et ai, 
Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)). 
[0092] In preferred embodiments, the method of this invention comprises the additional steps of searching either or 
30 both of the import, non-human and the consensus variable domain sequences for glycosylation sites, determining if the 
glycosylation is reasonably expected to be important for the desired antigen binding and biological activity of the anti- 
body (i.e., determining if the glycosylation site binds to antigen or changes a side chain of an amino acid residue that 
binds to antigen, or if the glycosylation enhances or weakens antigen binding, or is important for maintaining antibody 
affinity). If the import sequence bears the glycosylation site, it is preferred to substitute that site for the corresponding 
35 residues in the consensus human sequence if the glycosylation site is reasonably expected to be important. If only the 
consensus sequence, and not the import, bears the glycosylation site, it is preferred to eliminate that glycosylation site 
or substitute therefor the corresponding amino acid residues from the import sequence. 

[0093] Another preferred embodiment of the methods of this invention comprises aligning import antibody and the 
consensus antibody FR sequences, identifying import antibody FR residues which are non-homologous with the 
40 aligned consensus FR sequence, and for each such non-homologous import antibody FR residue, determining if the 
corresponding consensus antibody residue represents a residue which is highly conserved across all species at that 
site, and if it is so conserved, preparing a humanized antibody which comprises the consensus antibody amino acid 
residue at that site. 

[0094] In certain alternate embodiments, one need not utilize the modeling and evaluation steps described above, 
45 and may instead proceed with the steps of obtaining the amino acid sequence of at least a portion of an import, non- 
human antibody variable domain having a CDR and a FR, obtaining the amino acid sequence of at least a portion of a 
consensus human antibody variable domain having a CDR and a FR, substituting the non-human CDR for the human 
CDR in the consensus human antibody variable domain, and then substituting an amino acid residue for the consensus 
amino acid residue at at least one of the following sites: 

50 

a. (in the FR of the variable domain of the light chain) 4L, 35L, 36L, 38L, 43L, 44L, 58L, 46L, 62L, 63L, 64L, 65L, 
66L, 67L, 68L, 69L, 70L, 71 L, 73L, 85L, 87L, 98L, or 

b. (in the FR of the variable domain of the heavy chain) 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 
67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91 H, 92H, 93H, and 103H. 

55 

Preferably, the non-CDR residue substituted at the consensus FR site is the residue found at the corresponding location 
of the non-human antibody. If desired, one may utilize the other method steps described above for determining whether 
a particular amino acid residue can reasonably be expected to have undesirable effects, and remedying those effects. 
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[0095] If after making a humanized antibody according to the steps above and testing its activity one is not satisfied 
with the humanized antibody, one preferably reexamines the potential effects of the amino acids at the specific locations 
recited above. Additionally, it is desirable to reinvestigate any buried residues which are reasonably expected to affect 
the V L - V H interface but may not directly affect CDR conformation. It is also desirable to reevaluate the humanized anti- 
5 body utilizing the steps of the methods claimed herein. 

[0096] In certain embodiments of this invention, amino acid residues in the consensus human sequence are substi- 
tuted for by other amino acid residues. In preferred embodiments, residues from a particular non-human import 
sequence are substituted, however there are circumstances where it is desired to evaluate the effects of other amino 
acids. For example, if after making a humanized antibody according to the steps above and testing its activity one is not 
10 satisfied with the humanized antibody, one may compare the sequences of other classes or subgroups of human anti- 
bodies, or classes or subgroups of antibodies from the particular non-human species, and determine which other amino 
acid side chains and amino acid residues are found at particular locations and substituting such other residues. 

Antibodies 

15 

[0097] Certain aspects of this invention are directed to natural antibodies and to monoclonal antibodies, as illustrated 
in the Examples below and by antibody hybridomas deposited with the ATCC (as described below). Thus, the refer- 
ences throughout this description to the use of monoclonal antibodies are intended to include the use of natural or 
native antibodies as well as humanized and chimeric antibodies. As used herein, the term "antibody" includes the anti- 

20 body variable domain and other separable antibody domains unless specifically excluded. 

[0098] In accordance with certain aspects of this invention, antibodies to be humanized (import antibodies) are iso- 
lated from continuous hybrid cell lines formed by the fusion of antigen-primed immune lymphocytes with myeloma cells. 
[0099] In certain embodiments, the antibodies of this invention are obtained by routine screening. Polyclonal antibod- 
ies to an antigen generally are raised in animals by multiple subcutaneous (sc) or intraperitoneal (ip) injections of the 

25 antigen and an adjuvant. It may be useful to conjugate the antigen or a fragment containing the target amino acid 
sequence to a protein that is immunogenic in the species to be immunized, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a Afunctional or derivatizing agent, for example, male- 
imidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine 
residues), glutaraldehyde, succinic anhydride, SOCI 2 , or R 1 N = C = NR, where R and R 1 are different alkyl groups. 

30 [0100] The route and schedule of the host animal or cultured antibody-producing cells therefrom are generally in 
keeping with established and conventional techniques for antibody stimulation and production. While mice are fre- 
quently employed as the test model, it is contemplated that any mammalian subject including human subjects or anti- 
body-producing cells obtained therefrom can be manipulated according to the processes of this invention to serve as 
the basis for production of mammalian, including human, hybrid cell lines. 

35 [0101] Animals are typically immunized against the immunogenic conjugates or derivatives by combining 1 mg or 1 
^g of conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's complete adjuvant and injecting the solu- 
tion intradermal^ at multiple sites. One month later the animals are boosted with 1/5 to 1/10 the original amount of con- 
jugate in Freund's complete adjuvant (or other suitable adjuvant) by subcutaneous injection at multiple sites. 7 to 14 
days later animals are bled and the serum is assayed for antigen titer. Animals are boosted until the titer plateaus. Pref- 

40 erably, the animal is boosted with the conjugate of the same antigen, but conjugated to a different protein and/or through 
a different cross-linking agent. Conjugates also can be made in recombinant cell culture as protein fusions. Also, aggre- 
gating agents such as alum are used to enhance the immune response. 

[0102] After immunization, monoclonal antibodies are prepared by recovering immune lymphoid cells-typically spleen 
cells or lymphocytes from lymph node tissue-from immunized animals and immortalizing the cells in conventional fash- 
45 ion, e.g. by fusion with myeloma cells or by Epstein-Barr (EB)-virus transformation and screening for clones expressing 
the desired antibody. The hybridoma technique described originally by Kohler and Milstein, Eur. J, Immunol. 6:511 
(1976) has been widely applied to produce hybrid cell lines that secrete high levels of monoclonal antibodies against 
many specific antigens. 

[0103] It is possible to fuse cells of one species with another. However, it is preferable that the source of the immu- 

so nized antibody producing cells and the myeloma be from the same species. 

[0104] The hybrid cell lines can be maintained in culture in vitro in cell culture media. The cell lines of this invention 
can be selected and/or maintained in a composition comprising the continuous cell line in hypoxanthine-aminopterin 
thymidine (HAT) medium. In fact, once the hybridoma cell line is established, it can be maintained on a variety of nutri- 
tionally adequate media. Moreover, the hybrid cell lines can be stored and preserved in any number of conventional 

55 ways, including freezing and storage under liquid nitrogen. Frozen cell lines can be revived and cultured indefinitely with 
resumed synthesis and secretion of monoclonal antibody. The secreted antibody is recovered from tissue culture super- 
natant by conventional methods such as precipitation, Ion exchange chromatography, affinity chromatography, or the 
like. The antibodies described herein are also recovered from hybridoma cell cultures by conventional methods for puri- 
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fication of IgG or IgM as the case may be that heretofore have been used to purify these immunoglobulins from pooled 
plasma, e.g. ethanol or polyethylene glycol precipitation procedures. The purified antibodies are sterile filtered, and 
optionally are conjugated to a detectable marker such as an enzyme or spin label for use in diagnostic assays of the 
antigen in test samples. 

[0105] While routinely rodent monoclonal antibodies are used as the source of the import antibody, the invention is 
not limited to any species. Additionally, techniques developed for the production of chimeric antibodies (Morrison et al, 
Proa Natl. Acad. ScL, 81 :6851 (1984); Neuberger era/., Nature 312:604 (1984); Takeda et a/., Nature 314:452 (1985)) 
by splicing the genes from a mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity (such as ability to activate human complement and mediate 
ADCC) an be used; such antibodies are within the scope of this invention. 

[0106] Techniques for creating recombinant DNA versions of the antigen-binding regions of antibody molecules 
(known as Fab fragments) which bypass the generation of monoclonal antibodies are encompassed within the practice 
of this invention. One extracts antibody-specific messenger RNA molecules from immune system cells taken from an 
immunized animal, transcribes these into complementary DNA (cDNA), and clones the cDNA into a bacterial expres- 
sions system. One example of such a technique suitable for the practice of this invention was developed by researchers 
at Scripps/Stratagene, and incorporates a proprietary bacteriophage lambda vector system which contains a leader 
sequence that causes the expressed Fab protein to migrate to the periplasmic space (between the bacterial cell mem- 
brane and the cell wall) or to be secreted. One can rapidly generate and screen great numbers of functional FAb frag- 
ments for those which bind the antigen. Such FAb fragments with specificity for the antigen are specifically 
encompassed within the term "antibody" as it is defined, discussed, and claimed herein. 

Amino Acid Sequence Variants 

[0107] Amino acid sequence variants of the antibodies and polypeptides of this invention (referred to in herein as the 
25 target polypeptide) are prepared by introducing appropriate nucleotide changes into the DNA encoding the target 
polypeptide, or by in vitro synthesis of the desired target polypeptide. Such variants include, for example, humanized 
variants of non-human antibodies, as well as deletions from, or insertions or substitutions of, residues within particular 
amino acid sequences. Any combination of deletion, insertion, and substitution can be made to arrive at the final con- 
struct, provided that the final construct possesses the desired characteristics. The amino acid changes also may alter 
30 post-translational processes of the target polypeptide, such as changing the number or position of glycosylation sites, 
altering any membrane anchoring characteristics, and/or altering the intra-cellular location of the target polypeptide by 
inserting, deleting, or otherwise affecting any leader sequence of the native target polypeptide. 
[0108] In designing amino acid sequence variants of target polypeptides, the location of the mutation site and the 
nature of the mutation will depend on the target polypeptide characteristic(s) to be modified. The sites for mutation can 
35 be modified individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with 
more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting residues 
of the same or a different class adjacent to the located site, or combinations of options 1-3. In certain embodiments, 
these choices are guided by the methods for creating humanized sequences set forth above. 
[0109] A useful method for identification of certain residues or regions of the target polypeptide that ore preferred loca- 
te tions for mutagenesis is called "alanine scanning mutagenesis" as described by Cunningham and Wells (Science, 244: 
1081-1085 [1989]). Here, a residue or group of target residues am identified (e.g., charged residues such as arg, asp, 
his, lys, and glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or polyalanine) 
to affect the interaction of the amino acids with the surrounding aqueous environment in or outside the cell. Those 
domains demonstrating functional sensitivity to the substitutions then are refined by introducing further or other variants 
45 at or for the sites of substitution. Thus, while the site for introducing an amino acid sequence variation is predetermined, 
the nature of the mutation per se need not be predetermined. For example, to optimize the performance of a mutation 
at a given site, ala scanning or random mutagenesis may be conducted at the target codon or region and the expressed 
target polypeptide variants are screened for the optimal combination of desired activity. 

[0110] There are two principal variables in the construction of amino acid sequence variants: the location of the muta- 
50 tion site and the nature of the mutation. In general, the location and nature of the mutation chosen will depend upon the 
target polypeptide characteristic to be modified. 

[01 1 1 ] Amino acid sequence deletions of antibodies are generally not preferred, as maintaining the generally config- 
uration of an antibody is believed to be necessary for its activity. Any deletions will be selected so as to preserve the 
structure of the target antibody. 
55 [0112] Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions ranging in length from one 
residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions of single or multiple 
amino acid residues. Intrasequence insertions (i.e., insertions within the target polypeptide sequence) may range gen- 
erally from about 1 to 10 residues, more preferably 1 to 5, most preferably 1 to 3. Examples of terminal insertions 
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include the target polypeptide with an N-terminal methionyl residue, an artifact of the direct expression of target 
polypeptide in bacterial recombinant cell culture, and fusion of a heterologous N-terminal signal sequence to the N-ter- 
minus of the target polypeptide molecule to facilitate the secretion of the mature target polypeptide from recombinant 
host cells. Such signal sequences generally will be obtained from, and thus homologous to, the intended host cell spe- 
5 cies. Suitable sequences include STII or Ipp for E. cofi, alpha factor for yeast, and viral signals such as herpes gD for 
mammalian cells. 

[0113J Other insertional variants of the target polypeptide include the fusion to the N- or C-terminus of the target 
polypeptide of immunogenic polypeptides, e.g., bacterial polypeptides such as beta-lactamase or an enzyme encoded 
by the E. coli trp locus, or yeast protein, and C-terminal fusions with proteins having a long half-life such as immu- 
10 noglobulin constant regions (or other immunoglobulin regions), albumin, or ferritin, as described in WO 89/02922 pub- 
lished 6 ApriM 989. 

[0114] Another group of variants are amino acid substitution variants. These variants have at least one amino acid 
residue in the target polypeptide molecule removed and a different residue inserted in its place. Hie sites of greatest 
interest for substitutional mutagenesis include sites identified as the active site(s) of the target polypeptide, and sites 

is where the amino acids found in the target polypeptide from various species are substantially different in terms of side- 
chain bulk, charge, and/or hydrophobicity. Other sites for substitution are described infra, considering the affect of the 
substitution of the antigen binding, affinity and other characteristics of a particular target antibody. 
[0115] Other sites of interest are those in which particular residues of the target polypeptides obtained from various 
species are identical. These positions may be important for the biological activity of the target polypeptide. Those sites, 

20 especially those falling within a sequence of at least three other identically conserved sites, are substituted in a rela- 
tively conservative manner. If such substitutions result in a change in biological activity, then other changes are intro- 
duced and the products screened until the desired effect is obtained. 

[0116] Substantial modifications in function or immunological identity of the target polypeptide are accomplished by 
selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide backbone 
25 in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the 
molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are divided into groups based 
on common side chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 
30 (2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

35 

[01 1 7] Non-conservative substitutions will entail exchanging a member of one of these classes for another. Such sub- 
stituted residues may be introduced into regions of the target polypeptide that are homologous with other antibodies of 
the same class or subclass, or, more preferably, into the non-homologous regions of the molecule. 
[0118] Any cysteine residues not involved in maintaining the proper conformation of target polypeptide also may be 

40 substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant crosslinking. 
[0119] DNA encoding amino acid sequence variants of the target polypeptide is prepared by a variety of methods 
known in the art. These methods include, but are not limited to, isolation from a natural source (in the case of naturally 
occurring amino acid sequence variants) or preparation by oligonucleotide-mediated (or site-directed) mutagonosis, 
PCR mutagenesis, and cassette mutagenesis of an earlier prepared variant or a non-variant version of the target 

45 polypeptide. A particularly preferred method of gene conversion mutagenesis is described below in Example 1 . These 
techniques may utilized target polypeptide nucleic acid (DNA or RNA), or nucleic acid complementary to the target 
polypeptide nucleic acid. 

[01 20) Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution, deletion, and insertion 
variants of target polypeptide DNA. This technique is well known in the art as described by Adelman et ai, DNA. 2: 183 

50 (1983). Briefly, the target polypeptide DNA is altered by hybridizing an oligonucleotide encoding the desired mutation to 
a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered 
or native DNA sequence of the target polypeptide. After hybridization, a DNA polymerase is used to synthesize an 
entire second complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code 
for the selected alteration in the target polypeptide DNA. 

55 [0121] Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will have 
12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotide(s) coding for 
the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA template mole- 
cule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et 
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ai ( Proc. Natl. Acad. Sri. USA . 75: 5765 [1978]). 

[0122] Single-stranded DNA template may also be generated by denaturing double-stranded plasmid (or other) DNA 
using standard techniques. 

[0123] For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), the oligo- 

5 nucleotide is hybridized to the single-stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to synthesize the complementary strand of 
the template using the oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of the target polypeptide, and the other strand (the original template) encodes 
the native, unaltered sequence of the target polypeptide. This heteroduplex molecule is then transformed into a suitable 

10 host cell, usually a prokaryote such as E colt JM1 01 . After the cells are grown, they are plated onto agarose plates and 
screened using the oligonucleotide primer radiolabeled with 32-phosphate to identify the bacterial colonies that contain 
the mutated DNA. The mutated region is then removed and placed in an appropriate vector for protein production, gen- 
erally an expression vector of the type typically employed for transformation of an appropriate host. 
[0124] The method described immediately above may be modified such that a homoduplex molecule is created 

75 wherein both strands of the plasmid contain the mutation(s). The modifications are as follows: The single-stranded oli- 
gonucleotide is annealed to the single-stranded template as described above. A mixture of three deoxyribonucleotides, 
deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with a modi- 
fied thio-deoxyribocytosine called dCTP-(aS) (which can be obtained from Amersham Corporation). This mixture is 
added to the template-oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a strand of DNA 

20 identical to the template except for the mutated bases is generated. In addition, this new strand of DNA will contain 
dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuclease digestion. 
[01 25] After the template strand of the double-stranded heteroduplex is nicked with an appropriate restriction enzyme, 
the template strand can be digested with Exo lH nuclease or another appropriate nuclease past the region that contains 
the site(s) to be mutagenized. The reaction is then stopped to leave a molecule that is only partially single-stranded. A 

25 complete double-stranded DNA homoduplex is then formed using DNA polymerase in the presence of all four deoxyri- 
bonucleotide triphosphates, ATP, and DNA ligase. This homoduplex molecule can then be transformed into a suitable 
host cell such as E cofi JM101 , as described above. 

[0126] DNA encoding target polypeptide variants with more than one amino acid to be substituted may be generated 
in one of several ways. If the amino acids are located close together in the polypeptide chain, they may be mutated 
30 simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions, if, however, the 
amino acids are located some distance from each other (separated by more than about ten amino acids), it is more dif- 
ficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative meth- 
ods may be employed. 

[0127] In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The oligo- 
35 nucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of DNA that 
is synthesized from the template will encode all of the desired amino acid substitutions. 

[0128] The alternative method involves two or more rounds of mutagenesis to produce the desired mutant. The first 
round is as described for the single mutants: wild-type DNA is used for the template, an oligonucleotide encoding the 
first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule is then gener- 

40 ated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of mutagenesis as the 
template. Thus, this template already contains one or more mutations. The oligonucleotide encoding the additional 
desired amino acid substitution(s) is then annealed to this template, and the resulting strand of DNA now encodes 
mutations from both the first and second rounds of mutagenesis. This resultant DNA can be used as a template in a 
third round of mutagenesis, and so on. 

45 [01 29] PCR mutagenesis is also suitable for making amino acid variants of target polypeptide. While the following dis- 
cussion refers to DNA, it is understood that the technique also finds application with RNA. The PCR technique generally 
refers to the following procedure (see Erlich, supra, the chapter by R. Higuchi, p. 61-70): When small amounts of tem- 
plate DNA are used as starting material in a PCR, primers that differ slightly in sequence from the corresponding region 
in a template DNA can be used to generate relatively large quantities of a specific DNA fragment that differs from the 

so template sequence only at the positions where the primers differ from the template. For introduction of a mutation into 
a plasmid DNA. one of the primers is designed to overlap the position of the mutation and to contain the mutation; the 
sequence of the other primer must be identical to a stretch of sequence of the opposite strand of the plasmid, but this 
sequence can be located anywhere along the plasmid DNA. It is preferred, however, that the sequence of the second 
primer is located within 200 nucleotides from that of the first, such that in the end the entire amplified region of DNA 

55 bounded by the primers can be easily sequenced. PCR amplification using a primer pair like the one just described 
results in a population of DNA fragments that differ at the position of the mutation specified by the primer, and possibly 
at other positions, as template copying is somewhat error-prone. 

[0130] If the ratio of template to product material is extremely low, the vast majority of product DNA fragments incor- 



20 



EP 0 940 468 A1 

porate the desired mutation(s). This product material is used to replace the corresponding region in the plasmid that 
served as PCR template using standard DNA technology. Mutations at separate positions can be introduced simulta- 
neously by either using a mutant second primer, or performing a second PCR with different mutant primers and ligating 
the two resulting PCR fragments simultaneously to the vector fragment in a three (or more)-part ligation. 

5 [0131] In a specific example of PCR mutagenesis, template plasmid DNA (1 ng) is linearized by digestion with a 
restriction endonuclease that has a unique recognition site in the plasmid DNA outside of the region to be amplified. Of 
this material, 100 ng is added to a PCR mixture containing PCR buffer, which contains the four deoxynucleotide tri- 
phosphates and is included in the GeneAmp® kits (obtained from Perkin-Elmer Cetus. Norwalk, CT and Emeryville, 
CA), and 25 pmole of each oligonucleotide primer, to a final volume of 50 til. The reaction mixture is overlayed with 35 

w |xl mineral oil. The reaction is denatured for 5 minutes at 100°C, placed briefly on ice, and then 1 \i\ Thermus aquaticus 
(Taq) DNA polymerase (5 units/^l, purchased from Perkin-Elmer Cetus, Norwalk, CT and Emeryville, CA) is added 
below the mineral oil layer. The reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin- 
Elmer Cetus) programmed as follows: 2 min. at 55°C, then 30 sec. at 72°C, then 19 cycles of the following: 30 sec. at 
' 94°C, 30 sec. at 55°C, and 30 sec. at 72°C. 

75 [01 32 J At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous phase trans- 
ferred to a new vial, extracted with phenol/chloroform (50:50:vol), and ethanol precipitated, and the DNA is recovered 
by standard procedures. This material is subsequently subjected to the appropriate treatments for insertion into a vec- 
tor. 

[0133] Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells 
20 et ai. (Gene. 34: 315 [1985]). The starting material is the plasmid (or other vector) comprising the target polypeptide 
DNA to be mutated. The codon(s) in the target polypeptide DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they may 
be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them at appropri- 
ate locations in the target polypeptide DNA. After the restriction sites have been introduced into the plasmid, the plas- 
25 mid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between 
the restriction sites but containing the desired mutation[s) is synthesized using standard procedures. The two strands 
are synthesized separately and then hybridized together using standard techniques. This double-stranded oligonucle- 
otide is referred to as the cassette. This cassette is designed to have 3' and 5' ends that are compatible with the ends 
of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the mutated tar- 
so get polypeptide DNA sequence. 

Insertion of DNA into a Cloning Vehicle 

[01 34] The cDNA or genomic DNA encoding the target polypeptide is inserted into a replicable vector for further clon- 
35 ing (amplification of the DNA) or for expression. Many vectors are available, and selection of the appropriate vector will 
depend on 1) whether it is to be used for DNA amplification or for DNA expression, 2) the size of the DNA to be inserted 
into the vector, and 3) the host cell to be transformed with the vector. Each vector contains various components depend- 
ing on its function (amplification of DNA or expression of DNA) and the host cell for which it is compatible. The vector 
components generally include, but are not limited to, one or more of the following: a signal sequence, an origin of rep- 
40 lication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. 

(a) Signal Sequence Component 

[0135] In general, the signal sequence may be a component of the vector, or it may be a part of the target polypeptide 

45 DNA that is inserted into the vector. 

[0136] The target polypeptides of this invention may be expressed not only directly, but also as a fusion with a heter- 
ologous polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage site at the N-termi- 
nus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or it may 
be a part of the target polypeptide DNA that is inserted into the vector. Included within the scope of this invention are 

so target polypeptides with any native signal sequence deleted and replaced with a heterologous signal sequence. The 
heterologous signal sequence selected should be one that is recognized and processed (i.e. cleaved by a signal pepti- 
dase) by the host cell. For prokaryotic host cells that do not recognize and process the native target polypeptide signal 
sequence, the signal sequence is substituted by a prokaryotic signal sequence selected, for example, from the group 
of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the native target 

55 polypeptide signal sequence may be substituted by the yeast invertase, alpha factor, or acid phosphatase leaders. In 
mammalian cell expression the native signal sequence is satisfactory, although other mammalian signal sequences 
may be suitable. 
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(b) Origin of Replication Component 

[0137] Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in 
one or more selected host cells. Generally, in cloning vectors this sequence is one that enables the vector to replicate 

5 independently of the host chromosomal DNA, and includes origins of replication or autonomously replicating 
sequences. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of replication from 
the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2\i plasmid origin is suitable for yeast, and various 
viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the 
origin of replication component is not needed for mammalian expression vectors (the SV40 origin may typically be used 

10 only because it contains the early promoter). 

[0138] Most expression vectors are "shuttle" vectors, i.e. they are capable of replication in at least one class of organ- 
isms but can be transfected into another organism for expression. For example, a vector is cloned in E. coli and then 
the same vector is transfected into yeast or mammalian cells for expression even though it is not capable of replicating 
independently of the host cell chromosome. 

is [01 39] DNA may also be amplified by insertion into the host genome. This is readily accomplished using Bacillus spe- 
cies as hosts, for example, by including in the vector a DNA sequence that is complementary to a sequence found in 
Bacillus genomic DNA. Transfection of Bacillus with this vector results in homologous recombination with the genome 
and insertion of the target polypeptide DNA. However, the recovery of genomic DNA encoding the target polypeptide is 
more complex than that of an exogenously replicated vector because restriction enzyme digestion is required to excise 

20 the target polypeptide DNA. 

(c) Selection Gene Component 

[0140] Expression and cloning vectors should contain a selection gene, also termed a selectable marker. This gene 

25 encodes a protein necessary for the survival or growth of transformed host cells grown in a selective culture medium. 
Host cells not transformed with the vector containing the selection gene will not survive in the culture medium. Typical 
selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. ampicillin, neomycin, meth- 
otrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not available from com- 
plex media, e.g. the gene encoding D-alanine racemase for Bacilli 

30 [0141 ] One example of a selection scheme utilizes a drug to arrest growth of a host cell. Those cells that are success- 
fully transformed with a heterologous gene express a protein conferring drug resistance and thus survive the selection 
regimen. Examples of such dominant selection use the drugs neomycin (Southern et a/., J. Molec. Appl. Genet.. . 1: 327 
[1982]), mycophenolic acid (Mulligan et ai t Science. 209 : 1422[1980]) or hygromycin (Sugden et ai, Mol. Cell. Biol. . 5: 
410-413 [1985]). The three examples given above employ bacterial genes under eukaryotic control to convey resist- 

35 ance to the appropriate drug G41 8 or neomycin (geneticin), xgpt (mycophenolic acid), or hygromycin, respectively. 
[0142] Another example of suitable selectable markers for mammalian cells are those that enable the identification of 
cells competent to take up the target polypeptide nucleic acid, such as dihydrofolate reductase (DHFR) or thymidine 
kinase. The mammalian cell transformants are placed under selection pressure which only the transformants are 
uniquely adapted to survive by virtue of having taken up the marker. Selection pressure is imposed by culturing the 

40 transformants under conditions in which the concentration of selection agent in the medium is successively changed, 
thereby leading to amplification of both the selection gene and the DNA that encodes the target polypeptide. Amplifica- 
tion is the process by which genes in greater demand for the production of a protein critical for growth are reiterated in 
tandem within the chromosomes of successive generations of recombinant cells. Increased quantities of the target 
polypeptide are synthesized from the amplified DNA. 

45 [0143] For example, cells transformed with the DHFR selection gene are first identified by culturing ail of the trans- 
formants in a culture medium that contains methotrexate (Mtx), a competitive antagonist of DHFR. An appropriate host 
cell when wild-type DHFR is employed is the Chinese hamster ovary (CHO) cell line deficient in DHFR activity, prepared 
and propagated as described by Urlaub and Chasin, Proc. Natl. Acad. Set. USA. 77: 4216 [1980]. The transformed cells 
are then exposed to increased levels of methotrexate. This leads to the synthesis of multiple copies of the DHFR gene, 

so and, concomitantly, multiple copies of other DNA comprising the expression vectors, such as the DNA encoding the tar- 
get polypeptide. This amplification technique can be used with any otherwise suitable host, e.g., ATCC No. CCL61 
CHO-K1 , notwithstanding the presence of endogenous DHFR if, for example, a mutant DHFR gene that is highly resist- 
ant to Mtx is employed (EP 117,060). Alternatively, host cells (particularly wild-type hosts that contain endogenous 
DHFR) transformed or co-transformed with DNA sequences encoding the target polypeptide, wild-type DHFR protein, 

55 and another selectable marker such as aminoglycoside 3' phosphotransferase (APH) can be selected by cell growth in 
medium containing a selection agent for the selectable marker such as an aminoglycosidic antibiotic, e.g., kanamycin, 
neomycin, or G418. See U.S. Pat. No. 4,965,199. 

[0144] A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 [Stinchcomb et 
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al, Nature. 282 : 39 [1979]; Kingsman etai, Gene. 7: 141 [1979]; or Tschemper etal, Gene. 10: 157 [1980]). The frp1 
gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, 
ATCC No. 44076 or PEP4-1 (Jones, Genetics. S5: 12 [1977]). The presence of the trp1 lesion in the yeast host cell 
genome then provides an affective environment for detecting transformation by growth in the absence of tryptophan. 
5 Similarly, Let;2-deficient yeast strains (ATCC 20.622 or 38,626) are complemented by known plasmids bearing the 
Leu2 gene. 

(d) Promoter Component 

io [0145] Expression and cloning vectors usually contain a promoter that is recognized by the host organism and is oper- 
ably linked to the target polypeptide nucleic acid. Promoters are untranslated sequences located upstream (5') to the 
start codon of a structural gene (generally within about 100 to 1000 bp) that control the transcription and translation of 
a particular nucleic acid sequence, such as that encoding the target polypeptide, to which they are operably linked. 
Such promoters typically fall into two classes, inducible and constitutive. Inducible promoters are promoters that initiate 

75 increased levels of transcription from DNA under their control in response to some change in culture conditions, e.g. the 
presence or absence of a nutrient or a change in temperature. At this time a large number of promoters recognized by 
a variety of potential host cells are well known. These promoters are operably linked to DNA encoding the target 
polypeptide by removing the promoter from the source DNA by restriction enzyme digestion and inserting the isolated 
promoter sequence into the vector. Both the native target polypeptide promoter sequence and many heterologous pro- 

20 moters may be used to direct amplification and/or expression of the target polypeptide DNA. However, heterologous 
promoters are preferred, as they generally permit greater transcription and higher yields of expressed target polypep- 
tide as compared to the native target polypeptide promoter. 

[0146] Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter systems 
(Chang et al, Nature. 275 : 615 [1978]; and Goeddel et al., Nature, 281: 544 [1979]), alkaline phosphatase, a tryp- 

25 tophan (trp) promoter system (Goeddel, Nucleic Acids Res., fi: 4057 [1 980] and EP 36,776) and hybrid promoters such 
as the tac promoter (deBoer et al, Proc. Natl. Acad. Sci. USA. §0: 21 -25 [1983]). However, other known bacterial pro- 
moters are suitable. Their nucleotide sequences have been published, thereby enabling a skilled worker operably to 
ligate them to DNA encoding the target polypeptide (Siebenlist et al, Cfill, 2Q: 269 [1980] using linkers or adaptors to 
supply any required restiction sites. Promoters for use in bacterial systems also generally will contain a Shine-Dalgamo 

30 (S.D.) sequence operably linked to the DNA encoding the target polypeptide. 

[0147] Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase 
(Hitzeman et al, J. Biol. Chem.. 255 : 2073 [1980]) or other glycolytic enzymes (Hess et al, J. Adv. Enzvme Rea. . 7: 
149 [1968]; and Holland, Biochemistry. 17: 4900(1978]). such as enolase, glyceraldehyde-3-phosphate dehydroge- 
nase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycer- 

35 ate mutase. pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. 

[0148] Other yeast promoters, which are inducible promoters having the additional advantage of transcription control- 
led by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, 
degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate dehydroge- 
nase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast 

40 expression are further described in Hitzeman er al, EP 73.657A. Yeast enhancers also are advantageously used with 
yeast promoters. 

[0149] Promoter sequences are known for eukaryotes. Virtually all eukaryotic genes have an AT-rich region located 
approximately 25 to 30 bases upstream from the site where transcription is initiated. Another sequence found 70 to 80 
bases upstream from the start of transcription of many genes is a CXCAAT region where X may be any nucleotide. At 

45 the 3* end of most eukaryotic genes is an AATAAA sequence that may be the signal for addition of the poly A tail to the 
3' end of the coding sequence. All of these sequences are suitably inserted into mammalian-expression vectors. 
[01 50] Target polypeptide transcription from vectors in mammalian host cells is controlled by promoters obtained from 
the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1 ,504 published 5 July 1989), adenovirus (such 
as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus. hepatitis-B virus and most 

so preferably Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g. the actin promoter or an immu- 
noglobulin promoter, from heat-shock promoters, and from the promoter normally associated with the target polypep- 
tide sequence, provided such promoters are compatible with the host cell systems. 

[01 51 ] The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment that 
also contains the SV40 viral origin of replication. Fiers et al, Nature. 273:113 (1978); Mulligan and Berg, Science. 209: 
55 1422-1427 (1980); PavlaWs et al, Proc. Natl. Acad. Sci. USA. 73: 7398-7402 (1981). The immediate early promoter of 
the human cytomegalovirus is conveniently obtained as a Hindlll E restriction fragment. Greenaway et al, Gene . 18: 
355-360 (1982). A system for expressing DNA in mammalian hosts using the bovine papilloma virus as a vector is dis- 
closed in U.S. 4,419,446. A modification of this system is described in U.S. 4,601 ,978. See alsoGray era/., Nature, 235: 
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503-508 (1982) on expressing cDNA encoding immune interferon in monkey calls; Reyes et al., Nature. 297 : 598-601 
(1982) on expression of human p-interferon cDNA in mouse cells under the control of a thymidine kinase promoter from 
herpes simplex virus, Canaani and Berg, Proc. Natl. Acad. Sci. USA. Z2: 51 66-51 70 (1 982) on expression of the human 
interferon p1 gene in cultured mouse and rabbit cells, and Gorman et al., Proc. Natl. Acad. Sci. USA . 73: 6777-6781 
5 (1982) on expression of bacterial CAT sequences in CV-1 monkey kidney cells, chicken embryo fibroblasts, Chinese 
hamster ovary cells, HeLa cells, and mouse NIH-3T3 cells using the Rous sarcoma virus long terminal repeat as a pro- 
moter. 

(e) Enhancer Element Component 

10 

[0152] Transcription of DNA encoding the target polypeptide of this invention by higher eukaryotes is often increased 
by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10- 
300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and position independ- 
ent having been found 5' (Laimins et ai, Proc. Natl. Acad. Sci. USA . 73: 993 [1 981]) and 3' (Lusky et al., Mol. Cell Bio.. 

15 3: 1 108 [1983]) to the transcription unit, within an intron (Banerji et al., Cell. 33: 729 [1983]) as well as within the coding 
sequence itself (Osborne et al., Mol. Cell Bio.. 4: 1293 [1984]). Many enhancer sequences are now known from mam- 
malian genes (globin, elastase, albumin, a-fetoprotein and insulin). Typically, however, one will use an enhancer from a 
eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adeno- 

20 virus enhancers. See also Yaniv, Nature. 297 : 17-18 (1982) on enhancing elements for activation of eukaryotic promot- 
ers. The enhancer may be spliced into the vector at a position 5' or 3* to the target polypeptide DNA, but is preferably 
located at a site 5' from the promoter. 

(f) Transcription Termination Component 

25 

[0153] Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells 
from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for 
stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3' untranslated regions of 
eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated frag- 
30 ments in the untranslated portion of the mRNA encoding the target polypeptide. The 3' untranslated regions also 
include transcription termination sites. 

[01 54] Construction of suitable vectors containing one or more of the above listed components the desired coding and 
control sequences employs standard ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, 
and religated in the form desired to generate the plasmids required. 

35 [0155] For analysis to confirm correct sequences in plasmids constructed, the ligation mixtures are used to transform 
E. coli K12 strain 294 (ATCC 31,446) and successful transformants selected by ampicillin or tetracycline resistance 
where appropriate. Plasmids from the transformants are prepared, analyzed by restriction endonuclease digestion, 
and/or sequenced by the method of Messing ef al., Nucleic Acids Res.. 9: 309 (1981) or by the method of Maxam et 
al., Methods in Enzvmoloav. 65: 499 (1980). 

40 [01 56] Particularly useful in the practice of this invention are expression vectors that provide for the transient expres- 
sion in mammalian cells of DNA encoding the target polypeptide. In general, transient expression involves the use of 
an expression vector that is able to replicate efficiently in a host cell, such that the host cell accumulates many copies 
of the expression vector and, in turn, synthesizes high levels of a desired polypeptide encoded by the expression vector. 
Transient expression systems, comprising a suitable expression vector and a host cell, allow for the convenient positive 

45 identification of polypeptides encoded by cloned DNAs, as well as for the rapid screening of such polypeptides for 
desired biological or physiological properties. Thus, transient expression systems are particularly useful in the invention 
for purposes of identifying analogs and variants of the target polypeptide that have target polypoptide-like activity. 
[0157] Other methods, vectors, and host cells suitable for adaptation to the synthesis of the target polypeptide in 
recombinant vertebrate cell culture are described in Gething et al., Nature. 293 : 620-625 [1981]; Mantei et al., Nature. 

so 281 : 40-46 [1 979]; Levinson et al. ; EP 1 1 7,060; and EP 1 1 7,058. A particularly useful plasmid for mammalian cell cul- 
ture expression of the target polypeptide is pRK5 (EP pub. no. 307,247) or pSVI6B. 

Selection and Transformation of Host Cells 

55 [0158] Suitable host cells for cloning or expressing the vectors herein are the prokaryote, yeast, or higher eukaryote 
cells described above. Suitable prokaryotes include eubacteria, such as Gram-negative or Gram-positive organisms, 
for example, E. coli, Bacilli such as B. subtillis, Pseudomonas species such as P. aeruginose, Salmonella typhimurium, 
or Serratia marcescans. One preferred E. coli cloning host is E. coli 294 (ATCC 31 , 446), although other strains such 
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as E. coli B, E. coli x 1 776 (ATCC 31 ,537), and E. coli W31 1 0 (ATCC 27,325) are suitable. These examples are illustra- 
tive rather than limiting. Preferably the host cell should secrete minimal amounts of proteolytic enzymes. Alternatively, 
in vitro methods of cloning, e.g. PCR or other nucleic acid polymerase reactions, are suitable. 

[0159] In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts for target 

s polypeptide-encoding vectors. Saccharomyces cerevisiae, or common baker's yeast, is the most commonly used 
among lower eukaryotic host microorganisms. However, a number of other genera, species, and strains are commonly 
available and useful herein, such as Schizosaccharomyces pombe [Beach and Nurse, Nature. 290 : 140 (1981); EP 
139,383 published May 2, 1985], Kiuyveromyces hosts (U.S. 4,943,529) such as, e.g., K. lactis [Louvencourt etal.,1. 
Bacterid.. 737 (1983)], K fragilis, K. bulgaricus. K. thermotolerans, and K. marxianus, yarrowia [EP 402,226], Pichia 

10 pastotis [EP 183,070; Sreekrishna era/., J. Basic Microbiol.. 2fi: 265-278 (1988)], Candida, Trichoderma reesia [EP 
244,234], Neurospora crassa [Case etal., Proc. Natl. Acad. Sci. USA. 76: 5259-5263 (1979)], and filamentous fungi 
such as, e.g. Neurospora, Penicillium, Tolypocladium [WO 91/00357 published 10 January 1991], and Aspergillus 
hosts such as A. nidulans [Ballance et al., Biochem. Biophys. Res. Commun. . 112 : 284-289 (1983); Tilburn et al, 
Gene. 2£: 205-221 (1983); Yelton et a/., Proc. Natl. Acad. Sci. USA, fil: 1470-1474 (1984)] and A. niger [Kelly and 

is Hynes, EMBO J.. 4: 475-479 (1 985)]. 

[0160] Suitable host cells for the expression of glycosylated target polypeptide are derived from multicellular organ- 
isms. Such host cells are capable of complex processing and glycosylation activities. In principle, any higher eukaryotic 
cell culture is workable, whether from vertebrate or invertebrate culture. Examples of invertebrate cells include plant 
and insect cells. Numerous baculoviral strains and variants and corresponding permissive insect host cells from hosts 

20 such as Spodoptera frugiperda (caterpillar), Aedes aegypti (mosquito), Aedes albopictus (mosquito). Drosophila 
melanogaster (fruitfly), and Bombyx mori host cells have been identified. See, e.g., LucWow et at., Bio/Technoloay. 6: 
47-55 (1988); Miller etal., in Genetic Engineering . Setlow, J.K. etaL, eds., Vol. 8 (Plenum Publishing, 1986), pp. 277- 
279; and Maeda et al, Nature. 315 : 592-594 (1985). A variety of such viral strains are publicly available, e.g., the L-1 
variant of Autographa californica NPV and the Bm-5 strain of Bombyx mori NPV, and such viruses may be used as the 

25 virus herein according to the present invention, particularly for transfection of Spodoptera frugiperda cells. Plant cell 
cultures of cotton, corn, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts. Typically, plant cells are 
transfected by incubation with certain strains of the bacterium Agrobacterium tumefaciens, which has been previously 
manipulated to contain the target polypeptide DNA. During incubation of the plant cell culture with A. tumefaciens, the 
DNA encoding target polypeptide is transferred to the plant cell host such that it is transfected, and will, under appropri- 

30 ate conditions, express the target polypeptide DNA. In addition, regulatory and signal sequences compatible with plant 
cells are available, such as the nopaline synthase promoter and polyadenylation signal sequences. Depicker er al., i 
Mol. AddI. Gen.. 1: 561 (1982). In addition, DNA segments isolated from the upstream region of the T-DNA 780 gene 
are capable of activating or increasing transcription levels of plant-expressible genes in recombinant DNA-containing 
plant tissue. See EP 321,196 published 21 June 1989. 

35 [0161] However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture (tissue 
culture) has become a routine procedure in recent years [Tissue Culture. Academic Press, Kruse and Patterson, editors 
(1973)]. Examples of useful mammalian host call lines are monkey kidney CV1 line transformed by SV40 (COS-7, 
ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham 
et al., J. Gen Virol.. 36: 59 [1977]); baby hamster kidney cells (BHK. ATCC CCL 10); Chinese hamster ovary cells/- 

40 DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA . Z£ 4216 [1980]); mouse Sertoli cells (TM4, Mather. EioL 
Reorod.. 23: 243-251 [1980]); monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76. 
ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); 
buffalo rat liver calls (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, 
HB 8065); mouse mammary tumor (MMT 060562. ATCC CCL51); TRI cells (Mather era/.. Annals N.Y Acad. Sci. . 383: 

45 44-68 [1982]); MRC 5 cells; FS4 cells; and a human hepatoma cell line (Hep G2). Preferred host cells are human 
embryonic kidney 293 and Chinese hamster ovary cells. 

[0162] Host cells are transfected and preferably transformed with the above-described expression or cloning vectors 
of this invention and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transformants, or amplifying the genes encoding the desired sequences. 
so [01 63] Transfection refers to the taking up of an expression vector by a host cell whether or not any coding sequences 
are in fact expressed. Numerous methods of transfection are known to the ordinarily skilled artisan, for example, CaP0 4 
and electroporation. Successful transfection is generally recognized when any indication of the operation of this vector 
occurs within the host cell. 

[0164] Transformation means introducing DNA into an organism so that the DNA is replicable, either as an extrachro- 
55 mosomal element or by chromosomal integrant. Depending on the host cell used, transformation is done using stand- 
ard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in section 
1 .82 of Sambrook etal., supra, is generally used for prokaryotes or other cells that contain substantial cell-wall barriers. 
Infection with Agrobocterium tumefaciens is used for transformation of certain plant cells, as described by Shaw et al., 
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Gen e. 23: 315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells without such cell walls, the cal- 
cium phosphate precipitation method described in sections 16.30-16.37 of Sambrook etal, supra, is preferred. General 
aspects of mammalian cell host system transformations have been described by Axel in U.S. 4,399,216 issued 16 
August 1983. Transformations into yeast are typically carried out according to the method of Van Solingen et al., i 
5 Bact.. 130 : 946 (1 977) and Hsiao et al. , Proc. Natl. Acid. Sci. (USA). 76: 3829 (1 979). However, other methods for intro- 
ducing DNA into cells such as by nuclear injection, electroporation, or protoplast fusion may also be used. 

Culturinq the Host Cells 

10 [0165] Prokaryotic cells used to produce the target polypeptide of this invention are cultured in suitable media as 
described generally in Sambrook et al., supra, 

[0166] The mammalian host cells used to produce the target polypeptide of this invention may be cultured in a variety 
of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential Medium ([MEM], Sigma), 
RPMM640 (Sigma), and Dulbecco's Modified Eagle's Medium ([DMEM], Sigma) are suitable for culturing the host 

is cells. In addition, any of the media described in Ham and Wallace, Meth. Enz. §8: 44 (1979), Barnes and Sato, Anal. 
Biochem.. 102 : 255 (1980), U.S. 4,767,704; 4,657,866; 4,927,762; or 4,560,655; WO 90/03430; WO 87/00195; U.S. 
Pat. Re. 30,985, may be used as culture media for the host cells. Any of these media may be supplemented as neces- 
sary with hormones and/or other growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as 
sodium chloride, calcium, magnesium, and phosphate), buffers (such as HEPES], nucleosides (such as adenosine and 

20 thymidine), antibiotics (such as Gentamycin™ drug), trace elements (defined as inorganic compounds usually present 
at final concentrations in the micromolar range), and glucose or an equivalent energy source. Any other necessary sup- 
plements may also be included at appropriate concentrations that would be known to those skilled in the art. The culture 
conditions, such as temperature, pH, and the (ike, are those previously used with the host cell selected for expression, 
and will be apparent to the ordinarily skilled artisan. 

25 [0167] The host cells referred to in this disclosure encompass cells in in vitro culture as well as cells that are within a 
host animal. 

[0168] It is further envisioned that the target polypeptides of this invention may be produced by homologous recom- 
bination, or with recombinant production methods utilizing control elements introduced into cells already containing 
DNA encoding the target polypeptide currently in use in the field. For example, a powerful promoter/enhancer element, 
30 a suppressor, or an exogenous transcription modulatory element is inserted in the genome of the intended host cell in 
proximity and orientation sufficient to influence the transcription of DNA encoding the desired target polypeptide. The 
control element does not encode the target polypeptide of this invention, but the DNA is present in the host cell genome. 
One next screens for cells making the target polypeptide of this invention, or increased or decreased levels of expres- 
sion, as desired. 

35 

Detecting Gene Amplification/Expression 

[0169] Gene amplification and/or expression may be measured in a sample directly, for example, by conventional 
Southern blotting, northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Nat. Acad. Sci. USA. 77: 

40 5201 -5205 [1980]), dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, based on 
the sequences provided herein. Various labels may be employed, most commonly radioisotopes, particularly 32 P. How- 
ever, other techniques may also be employed, such as using biotin-modif ied nucleotides for introduction into a polynu- 
cleotide. The biotin then serves as the site for binding to avidin or antibodies, which may be labeled with a wide variety 
of labels, such as radionuclides, fluorescers, enzymes, or the like. Alternatively, antibodies may be employed that can 

45 recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 
duplexes. The antibodies in turn may be labeled and the assay may be earned out where the duplex is bound to a sur- 
face, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. 
[0170] Gene expression, alternatively, may be measured by immunological methods, such as immunohistochemical 
staining of tissue sections and assay of cell culture or body fluids, to quantitate directly the expression of gene product. 

50 With immunohistochemical staining techniques, a cell sample is prepared, typically by dehydration and fixation, fol- 
lowed by reaction with labeled antibodies specific for the gene product coupled, where the labels are usually visually 
detectable, such as enzymatic labels, fluorescent labels, luminescent labels, and the like. A particularly sensitive stain- 
ing technique suitable for use in the present invention is described by Hsu et al., Am. J. Clin. Path. . 75: 734-738 (1980). 
[0171] Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either monoclonal 

55 or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native tar- 
get polypeptide or against a synthetic peptide based on the DNA sequences provided herein as described further in 
Section 4 below. 
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Purification of The Target polypeptide 

[0172] The target polypeptide preferably is recovered from the culture medium as a secreted polypeptide, although it 
also may be recovered from host cell lysates when directly expressed without a secretory signal. 

s [0173] When the target polypeptide is expressed in a recombinant cell other than one of human origin, the target 
polypeptide is completely free of proteins or polypeptides of human origin. However, it is necessary to purify the target 
polypeptide from recombinant cell proteins or polypeptides to obtain preparations that are substantially homogeneous 
as to the target polypeptide. As a first step, the culture medium or lysate is centrifuged to remove particulate cell debris. 
The membrane and soluble protein fractions are then separated. The target polypeptide may then be purified from the 

w soluble protein fraction and from the membrane fraction of the culture lysate, depending on whether the target polypep- 
tide is membrane bound. The following procedures are exemplary of suitable purification procedures: fractionation on 
immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a 
cation exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration 
using, for example, Sephadex G-75; and protein A Sepharose columns to remove contaminants such as IgG. 

is [0174] Target polypeptide variants in which residues have been deleted, inserted or substituted are recovered in the 
same fashion, taking account of any substantial changes in properties occasioned by the variation. For example, prep- 
aration of a target polypeptide fusion with another protein or polypeptide, e.g. a bacterial or viral antigen, facilitates puri- 
fication; an immunoaffinity column containing antibody to the antigen (or containing antigen, where the target 
polypeptide is an antibody) can be used to adsorb the fusion. Immunoaffinity columns such as a rabbit polyclonal anti- 

20 target polypeptide column can be employed to absorb the target polypeptide variant by binding it to at least one remain- 
ing immune epitope. A protease inhibitor such as phenyl methyl sulfonyl fluoride (PMSF) also may be useful to inhibit 
proteolytic degradation during purification, and antibiotics may be included to prevent the growth of adventitious con- 
taminants. One skilled in the art will appreciate that purification methods suitable for native target polypeptide may 
require modification to account for changes in the character of the target polypeptide or its variants upon expression in 

25 recombinant cell culture. 

Covalent Modifications of Target Polypeptides 

[0175] Covalent modifications of target polypeptides are included within the scope of this invention. One type of cov- 
30 alent modification included within the scope of this invention is a target polypeptide fragment. Target polypeptide frag- 
ments having up to about 40 amino acid residues may be conveniently prepared by chemical synthesis, or by enzymatic 
or chemical cleavage of the full-length target polypeptide or variant target polypeptide. Other types of covalent modifi- 
cations of the target polypeptide or fragments thereof are introduced into the molecule by reacting specific amino acid 
residues of the target polypeptide or fragments thereof with an organic derivatizing agent that is capable of reacting with 
35 selected side chains or the N- or C-terminal residues. 

[0176] Cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding amines), such as chlo- 
roacetic acid or chloroacetamide. to give carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl residues also are 
derivatized by reaction with bromotrifluoroacetone, a-bromo-p-(5-imidozoyl)propionic acid, chloroacetyl phosphate, N- 
alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4- 
40 nitrophenol, or chloro-7-nitrobenzo-2-oxa-1 ,3-diazole. 

[0177] Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because this agent is rel- 
atively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; the reaction is preferably per- 
formed in 0.1 M sodium cacodylate at pH 6.0. 

[0178] Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. Derivatiza- 
45 tion with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable reagents for deriva- 
tizing a-amino-containing residues include imidoesters such as methyl picolinimidate; pyridoxal phosphate; pyricloxal; 
chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2,4-pentanedione; and transaminase-catalyzed 
reaction with glyoxylate. 

[0179] Arginyl residues are modified by reaction with one or several conventional reagents, among them phenylgly- 
so oxal, 2,3-butanedione, 1 ,2-cyclohexanedione, and ninhydrin. Derivatization of arginine residues requires that the reac- 
tion be performed in alkaline conditions because of the high pK a of the guanidine functional group. Furthermore, these 
reagents may react with the groups of lysine as well as the arginine epsilon-amino group. 

[0180] The specific modification of tyrosyl residues may be made, with particular interest in introducing spectral labels 
into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromethane. Most commonly, N-acetyli- 
55 midizole and tetranitromethane are used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl 
residues are iodinated using 125 l or 131 l to prepare labeled proteins for use in radioimmunoassay, the chloramine T 
method described above being suitable. 

[0181] Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides (FT-N = C 
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= N-R'), where R and R' are different alkyl groups, such as 1-cyclohexyl-3-[2-morpholinyl-4-ethyl) carbodiimide or 1- 
ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to 
asparaginyl and glutaminyl residues by reaction with ammonium ions. 

[0182] Derivatization with Afunctional agents is useful for crosslinking target polypeptide to a water-insoluble support 
s matrix or surface for use in the method for purifying anti-target polypeptide antibodies, and vice versa. Commonly used 
crosslinking agents include, e.g., 1,1-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, 
for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl eaters such as 
3,3'-dithiobis(succinimidylpropionate), and Afunctional maleimides such as bis-N-maleimido-1,8-octane. Derivatizing 
agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate yield photoactivatable intermediates that are capable of 
10 forming crosslinks in the presence of light. Alternatively, reactive water-insoluble matrices such as cyanogen bromide- 
activated carbohydrates and the reactive substrates described in U.S. 3,969,287; 3,691,016; 4,195,128; 4,247,642; 
4,229,537; and 4,330,440 are employed for protein immobilization. 

[0183] Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and aspartyl 
residues, respectively. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of these 

is residues falls within the scope of this invention. 

[0184] Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or 
threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, Pro- 
teins: Structure and Molecular Properties. W.H. Freeman & Co., San Francisco, pp. 79-86 [1983]), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

20 [0185] Another type of covalent modification of the target polypeptide included within the scope of this invention com- 
prises altering the native glycosylation pattern of the polypeptide. By altering is meant deleting one or more carbohy- 
drate moieties found in the native target polypeptide, and/or adding one or more glycosylation sites that are not present 
in the native target polypeptide. 

[0186] Glycosylation of polypeptides is typically either N-linked or O-linked. N-linked refers to the attachment of the 
25 carbohydrate moiety to the side chain of an asparagine residue. The tri-peptide sequences asparagine-X-serine and 
asparagine-X-threonine, where X is any amino acid except proline, are the recognition sequences for enzymatic attach- 
ment of the carbohydrate moiety to the asparagine side chain. Thus, the presence of either of these tripeptide 
sequences in a polypeptide creates a potential glycosylation site. O-linked glycosylation refers to the attachment of one 
of the sugars N-acetylgalactosamine, galactose, or xylose, to a hydroxyamino acid, most commonly serine or threonine, 
30 although 5-hydroxyproline or 5-hydroxylysine may also be used. 

[0187] Addition of glycosylation sites to the target polypeptide is conveniently accomplished by altering the amino acid 
sequence such that it contains one or more of the above-described tri-peptide sequences (for N-linked glycosylation 
sites). The alteration may also be made by the addition of, or substitution by, one or more serine or threonine residues 
to the native target polypeptide sequence (for O-linked glycosylation sites). For ease, the target polypeptide amino acid 
35 sequence is preferably altered through changes at the DNA level, particularly by mutating the DN A encoding the target 
polypeptide at preselected bases such that codons are generated that will translate into the desired amino acids. The 
DNA mutation(s) may be made using methods described above under the heading of "Amino Acid Sequence Variants 
of Target Polypeptide". 

[0188] Another means of increasing the number of carbohydrate moieties on the target polypeptide is by chemical or 
40 enzymatic coupling of glycosides to the polypeptide. These procedures are advantageous in that they do not require 
production of the polypeptide in a host cell that has glycosylation capabilities for N- and O- linked glycosylation. 
Depending on the coupling mode used, the sugar(s) may be attached to (a) arginine and histidine, (b) free carboxyl 
groups, (c) free sulfhydryl groups such as those of cysteine, (d) free hydroxyl groups such as those of serine, threonine, 
or hydroxyproline, (e) aromatic residues such as those of phenylalanine, tyrosine, or tryptophan, or (f) the amide group 
45 of glutamine. These methods are described in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston 
(CTC CEIL Pev, PiQChem T , pp. 259-306 [1981]). 

[0189] Removal of carbohydrate moieties present on the native target polypeptide may be accomplished chemically 
or enzymatically. Chemical deglycosylation requires exposure of the polypeptide to the compound trf luoromethanesul- 
fonic acid, or an equivalent compound. This treatment results in the cleavage of most or all sugars except the linking 

so sugar (N-acetylglucosamine or N-acetylgalactosamine), while leaving the polypeptide intact. Chemical deglycosylation 
is described by Hakimuddin et aL (Arch. Biochem. Biophvs. . 25^:52 [1987]) and by Edge et aL (Anal. Biochem. . 
113:131 [1981]). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exoglycosidases as described by Thotakura et aL (Meth. Enzymol.. 138:350 [1987]). 
[0190] Glycosylation at potential glycosylation sites may be prevented by the use of the compound tunicemycin as 

55 described by Duskin et aL (J. Biol. Chem.. 257:3105 [1982]). Tunicamycin blocks the formation of protein-N-glycoside 
linkages. 

[01 91 ] Another type of covalent modification of the target polypeptide comprises linking the target polypeptide to var- 
ious nonproteinaceous polymers, e.g. polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set 
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forth in U.S. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

[0192] The target polypeptide also may be entrapped in microcapsules prepared, for example, by coacervation tech- 
niques or by interfacial polymerization (for example, hydroxymethylcellulose or gelatin-microcapsules and poly-[methyl- 
methacylate] microcapsules, respectively), in colloidal drug delivery systems (for example, liposomes, albumin 
5 microspheres, microemulsions, nano-particles and nanocapsules), or in macroemulsions. Such techniques are dis- 
closed in Remington's Pharmaceutical Sciences. 16th edition, Osol, A., Ed., (1980). 

[0193] Target polypeptide preparations are also useful in generating antibodies, for screening for binding partners, as 
standards in assays for the target polypeptide (e.g. by labeling the target polypeptide for use as a standard in a radio- 
immunoassay, enzyme-linked immunoassay, or radioreceptor assay), in affinity purification techniques, and in compet- 

10 itive-type receptor binding assays when labeled with radioiodine, enzymes, fluorophores, spin labels, and the like. 
[01 94] Since it is often difficult to predict in advance the characteristics of a variant target polypeptide, it will be appre- 
ciated that some screening of the recovered variant will be needed to select the optimal variant. For example, a change 
in the immunological character of the target polypeptide molecule, such as affinity for a given antigen or antibody, is 
measured by a competitive-type immunoassay. The variant is assayed for changes in the suppression or enhancement 

75 of its activity by comparison to the activity observed for the target polypeptide in the same assay. Other potential mod- 
ifications of protein or polypeptide properties such as redox or thermal stability, hydrophobicity, susceptibility to proteo- 
lytic degradation, stability in recombinant cell culture or in plasma, or the tendency to aggregate with carriers or into 
multimers are assayed by methods well known in the art. 

20 Diagnostic and Related Uses of the Antibodies 

[0195] The antibodies of this invention are useful in diagnostic assays for antigen expression in specific cells or tis- 
sues. The antibodies are detectably labeled and/or are immobilized on an insoluble matrix. 

[0196] The antibodies of this invention find further use for the affinity purification of the antigen from recombinant cell 
25 culture or natural sources. Suitable diagnostic essays for the antigen and its antibodies depend on the particular antigen 
or antibody. Generally, such assays include competitive and sandwich essays, and steric inhibition essays. Competitive 
and sandwich methods employ a phase-separation step as an integral part of the method while steric inhibition essays 
are conducted in a single reaction mixture. Fundamentally, the same procedures are used for the assay of the antigen 
and for substances that bind the antigen, although certain methods will be favored depending upon the molecular 
30 weight of the substance being assayed. Therefore, the substance to be tested is referred to herein as an analyte, irre- 
spective of its status otherwise as an antigen or antibody, and proteins that bind to the analyte are denominated binding 
partners, whether they be antibodies, cell surface receptors, or antigens. 

[0197] Analytical methods for the antigen or its antibodies all use one or more of the following reagents: labeled ana- 
lyte analogue, immobilized analyte analogue, labeled binding partner, immobilized binding partner and steric conju- 

35 gates. The labeled reagents also are known as "tracers." 

[0198] The label used (and this is also useful to label antigen nucleic acid for use as a probe) is any detectable func- 
tionality that does not interfere with the binding of analyte and its binding partner. Numerous labels are known for use 
in immunoassay, examples including moieties that may be detected directly, such as f luorochrome, chemiluminescent, 
and radioactive labels, as well as moieties, such as enzymes, that must be reacted or derivatized to be detected. Exam- 

40 pies of such labels include the radioisotopes 32 P, 14 C, 125 l 3 H, and 131 1 fluorophores such as rare earth chelates or flu- 
orescein and its derivatives, rhodamine and its derivatives, dansyl, umbel liferone, luceriferases, e.g., firefly luciferase 
and bacterial luciferase (U.S. Pat. No. 4,737,456), luciferin, 2,3<lihydrophthalazinediones, horseradish peroxidase 
(HRP), alkaline phosphatase, p-galactosidase, glucoamylase, lysozyme, saccharide oxidases, e.g., glucose oxidase, 
galactose oxidase, and glucose-6-phosphate dehydrogenase, heterocyclic oxidases such as uricase and xanthine oxi- 

45 dase, coupled with an enzyme that employs hydrogen peroxide to oxidize a dye precursor such as HRP, lactoperoxi- 
dase, or microperoxidase, biotin/avidin, spin labels, bacteriophage labels, stable free radicals, and the like. 
[0199] Conventional methods are available to bind these labels covalently to proteins or polypeptides. For instance, 
coupling agents such as dialdehydes, carbodiimides, dimaleimides, bis-imidates, bis-diazotized benzidine, and the like 
may be used to tag the antibodies with the above-described fluorescent, chemiluminescent, and enzyme labels. See, 

50 for example, U.S. Pat. Nos. 3,940,475 (fluorimetry) and 3,645,090 (enzymes); Hunter et a/., Nature. 144: 945 (1962); 
David era/., Biochemistry. 13: 1014-1021 (1974); Pain etai, J. Immunol. Methods . 40: 219-230 (1981); and Nygren, i 
Histochem. and Cytochem.. 2Q: 407-412 (1982). Preferred labels herein are enzymes such as horseradish peroxidase 
and alkaline phosphatase. The conjugation of such label, including the enzymes, to the antibody is a standard manip- 
ulative procedure for one of ordinary skill in immunoassay techniques. See, for example, O'Sullivan et a!., "Methods for 

ss the Preparation of Enzyme-antibody Conjugates for Use In Enzyme Immunoassay," in Methods in Enzvmoloov. ed. J.J. 
Langone and H. Van Vunakis, Vol. 73 (Academic Press, New York, New York, 1981), pp. 147-166. Such bonding meth- 
ods are suitable for use with the antibodies and polypeptides of this invention. 

[0200] Immobilization of reagents is required for certain assay methods. Immobilization entails separating the binding 
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partner from any analyte that remains free in solution. This conventionally is accomplished by either insolubilizing the 
binding partner or analyte analogue before the assay procedure, as by adsorption to a water-insoluble matrix or surface 
(Bennich etal., U.S. 3.720,760), by covalent coupling (for example, using glutaraldehyde cross-linking), or by insolubi- 
lizing the partner or analogue afterward, e.g., by immunoprecipitation. 

s [0201 ] Other assay methods, Known as competitive or sandwich assays, are well established and widely used in the 
commercial diagnostics industry. 

[0202] Competitive assays rely on the ability of a tracer analogue to compete with the test sample analyte for a limited 
number of binding sites on a common binding partner. The binding partner generally is insolubilized before or after the 
competition and then the tracer and analyte bound to the binding partner are separated from the unbound tracer and 

10 analyte. This separation is accomplished by decanting (where the binding partner was preinsolubilized) or by centrifug- 
ing (where the binding partner was precipitated after the competitive reaction). The amount of test sample analyte is 
inversely proportional to the amount of bound tracer as measured by the amount of marker substance. Dose-response 
curves with known amounts of analyte are prepared and compared with the test results to quantitatively determine the 
amount of analyte present in the test sample. These assays are called ELISA systems when enzymes are used as the 

75 detectable markers. 

[0203] Another species of competitive assay, called a "homogeneous" assay, does not require a phase separation. 
Here, a conjugate of an enzyme with the analyte is prepared and used such that when anti-analyte binds to the analyte 
the presence of the anti-analyte modifies the enzyme activity. In this case, the antigen or its immunologically active frag- 
ments are conjugated with a Afunctional organic bridge to an enzyme such as peroxidase. Conjugates are selected for 
20 use with antibody so that binding of the antibody inhibits or potentiates the enzyme activity of the label. This method per 
se is widely practiced under the name of EMIT 

[0204] Steric conjugates are used in steric hindrance methods for homogeneous assay. These conjugates are syn- 
thesized by covalently linking a low-molecular-weight hapten to a small analyte so that antibody to hapten substantially 
is unable to bind the conjugate at the same time as anti-analyte. Under this assay procedure the analyte present in the 

25 test sample will bind anti-analyte, thereby allowing anti-hapten to bind the conjugate, resulting in a change in the char- 
acter of the conjugate hapten, e.g., a change in fluorescence when the hapten is a fluorophore. 
[0205] Sandwich assays particularly are useful for the determination of antigen or antibodies. In sequential sandwich 
assays an immobilized binding partner is used to adsorb test sample analyte, the test sample is removed as by wash- 
ing, the bound analyte is used to adsorb labeled binding partner, and bound material is then separated from residual 

30 tracer. The amount of bound tracer is directly proportional to test sample analyte. In "simultaneous" sandwich assays 
the test sample is not separated before adding the labeled binding partner. A sequential sandwich assay using an anti- 
antigen monoclonal antibody as one antibody and a polyclonal anti-antigen antibody as the other is useful in testing 
samples for particular antigen activity. 

[0206] The foregoing are merely exemplary diagnostic assays for the import and humanized antibodies of this inven- 
35 tion. Other methods now or hereafter developed for the determination of these analytes are included within the scope 
hereof, including the bioassays described above. 

Immunotoxins 

40 [0207] This invention is also directed to immunochemical derivatives of the antibodies of this invention such as immu- 
notoxins (conjugates of the antibody and a cytotoxic moiety). Antibodies which carry the appropriate effector functions, 
such as with their constant domains, are also used to induce lysis through the natural complement process, and to inter- 
act with antibody dependent cytotoxic cells normally present. 

[0208] For example, purified, sterile filtered antibodies are optionally conjugated to a cytotoxin such as ricin for use in 
45 AIDS therapy. US Patent Application Serial No. 07/350,895 illustrates methods for making and using immunotoxins for 
the treatment of HIV infection. The methods of this invention, for example, are suitable for obtaining humanized antibod- 
ies for use as immunotoxins for use in AIDS therapy. 

[0209] The cytotoxic moiety of the immunotoxin may be a cytotoxic drug or an enzymatically active toxin of bacterial, 
fungal, plant or animal origin, or an enzymatically active fragment of such a toxin. Enzymatically active toxins and frag- 

50 ments thereof used are diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dian- 
thin proteins, Phytolaca americans proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, 
sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin and the tricothecenes. In 
another embodiment, the antibodies are conjugated to small molecule anticancer drugs such as cisplatin or 5FU. Con- 

55 jugates of the monoclonal antibody and such cytotoxic moieties are made using a variety of Afunctional protein cou- 
pling agents. Examples of such reagents are SPDP, IT , Afunctional derivatives of imidoesters such as dimethyl 
adipimidate HCI, active esters such as disuccinimidyl suberate, aldehydes such as glutaraldehyde, bis-azido com- 
pounds such as bis (p-azidobenzoyl) hexanediamine, bis-diazonium derivatives such as bis- (p-diazoniumbenzoyl)- - 



30 



EP 0 940 468 A1 

ethylenediamine, diisocyanates such as tolylene 2,6-diisocyanate and bis-active fluorine compounds such as 1,5-dif- 
luoro- 2,4-dinitrobenzene. The lysing portion of a toxin may be joined to the Fab fragment of the antibodies. 
[021 0J Immunotoxins can be made in a variety of ways, as discussed herein. Commonly known crosslinking reagents 
can be used to yield stable conjugates. 
s [021 1 ] Advantageously, monoclonal antibodies specifically binding the domain of the antigen which is exposed on the 
infected cell surface, are conjugated to ricin A chain. Most advantageously the ricin A chain is deglycosylated and pro- 
duced through recombinant means. An advantageous method of making the ricin immunotoxin is described in Vitetta 
era/., Science 238:1098 (1987). 

[0212] When used to kill infected human cells in vitro for diagnostic purposes, the conjugates will typically be added 
w to the cell culture medium at a concentration of at least about 1 0 nM. The formulation and mode of administration for in 
vitro use are not critical. Aqueous formulations that are compatible with the culture or perfusion medium will normally 
be used. Cytotoxicity may be read by conventional techniques. 

[0213] Cytotoxic radiopharmaceuticals for treating infected cells may be made by conjugating radioactive isotopes 
(e.g. I, Y, Pr) to the antibodies. Advantageously alpha particle-emitting isotopes are used. The term 'cytotoxic moiety" 

is as used herein is intended to include such isotopes. 

[0214] In a preferred embodiment, ricin A chain is deglycosylated or produced without oligosaccharides, to decrease 
its clearance by irrelevant clearance mechanisms (e.g., the liver). In another embodiment, whole ricin (A chain plus B 
chain) is conjugated to antibody if the galactose binding property of B-chain can be blocked ("blocked ricin"). 
[021 5] In a further embodiment toxin-conjugates are made with Fab or Ffab^ fragments. Because of their relatively 

20 small size these fragments can better penetrate tissue to reach infected cells. 

[021 6] In another embodiment, fusogenic liposomes are filled with a cytotoxic drug and the liposomes are coated with 
antibodies specifically binding the particular antigen. 

Antibody Dependent Cellular Cytotoxicity 

25 

[0217] Certain aspects of this invention involve antibodies which are (a) directed against a particular antigen and (b) 
belong to a subclass or isotype that is capable of mediating the lysis of ceils to which the antibody molecule binds. More 
specifically, these antibodies should belong to a subclass or isotype that, upon complexing with cell surface proteins, 
activates serum complement and/or mediates antibody dependent cellular cytotoxicity (ADCC) by activating effector 

30 cells such as natural killer cells or macrophages. 

[0218] Biological activity of antibodies is known to be determined, to a large extent, by the constant domains or Fc 
region of the antibody molecule (Uananue and Benacerraf, Textbook of Immunology, 2nd Edition, Williams & Wilkins, 
p. 218 (1984)). This includes their ability to activate complement and to mediate antibody-dependent cellular cytotoxic- 
ity (ADCC) as effected by leukocytes. Antibodies of different classes and subclasses differ in this respect, as do arrti- 

35 bodies from the same subclass but different species; according to the present invention, antibodies of those classes 
having the desired biological activity are prepared. Preparation of these antibodies involves the selection of antibody 
constant domains are their incorporation in the humanized antibody by known technique. For example, mouse immu- 
noglobulins of the lgG3 and lgG2a class are capable of activating serum complement upon binding to the target cells 
which express the cognate antigen, and therefore humanized antibodies which incorporate igG3 and lgG2a effector 

40 functions are desirable for certain therapeutic applications. 

[0219] In general, mouse antibodies of the lgG2a and lgG3 subclass and occasionally lgG1 can mediate ADCC, and 
antibodies of the lgG3, lgG2a, and IgM subclasses bind and activate serum complement. Complement activation gen- 
erally requires the binding of at least two IgG molecules in close proximity on the target cell. However, the binding of 
only one IgM molecule activates serum complement. 

45 [0220] The ability of any particular antibody to mediate lysis of the target cell by complement activation and/or ADCC 
can be assayed. The cells of interest are grown and labeled in vitro; the antibody is added to the cell culture in combi- 
nation with either serum complement or immune cells which may be activated by the antigen antibody complexes. 
Cytolysis of the target cells is detected by the release of label from the lysed cells. In fact, antibodies can be screened 
using the patient's own serum as a source of complement and/or immune cells. The antibody that is capable of activat- 

so ing complement or mediating ADCC in the in vitro test can then be used therapeutically in that particular patient. 

[0221] This invention specifically encompasses consensus Fc antibody domains prepared and used according to the 
teachings of this invention. 

Therapeutic and Other Uses of the Antibodies 

55 

[0222] When used in vivo for therapy, the antibodies of the subject invention are administered to the patient in thera- 
peutically effective amounts (i.e. amounts that have desired therapeutic effect). They will normally be administered 
parenteraily. The dose and dosage regimen will depend upon the degree of the infection, the characteristics of the par- 
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ticular antibody or immunotoxin used, e.g., its therapeutic index, the patient, and the patient's history. Advantageously 
the antibody or immunotoxin is administered continuously over a period of 1-2 weeks, intravenously to treat cells in the 
vasculature and subcutaneously and intraperitoneal ly to treat regional lymph nodes. Optionally, the administration is 
made during the course of adjunct therapy such as combined cycles of radiation, chemotherapeutic treatment, or 
5 administration of tumor necrosis factor, interferon or other cytoprotective or immunomodulatory agent. 

[0223] For parenteral administration the antibodies will be formulated in a unit dosage injectable form (solution, sus- 
pension, emulsion) in association with a pharmaceutical^ acceptable parenteral vehicle. Such vehicles are inherently 
nontoxic, and non-therapeutic. Examples of such vehicles are water, saline, Ringer's solution, dextrose solution, and 
5% human serum albumin. Nonaqueous vehicles such as fixed oils and ethyl oleate can also be used. Liposomes may 
io be used as carriers. The vehicle may contain minor amounts of additives such as substances that enhance isotonicity 
and chemical stability, e.g., buffers and preservatives. The antibodies will typically be formulated in such vehicles at 
concentrations of about 1 mg/ml to 10 mg/ml. 

[0224] Use of IgM antibodies may be preferred for certain applications, however IgG molecules by being smaller may 
be more able than IgM molecules to localize to certain types of infected cells. 

75 [0225] TTiere is evidence that complement activation in vivo leads to a variety of biological effects, including the induc- 
tion of an inflammatory response and the activation of macrophages (Uananue and Benecerraf, Textbook of 
Immunology, 2nd Edition, Williams & Wilkins, p. 218 (1984)). The increased vasodilation accompanying inflammation 
may increase the ability of various agents to localize in infected cells. Therefore, antigen-antibody combinations of the 
type specified by this invention can be used therapeutically in many ways. Additionally, purified antigens (Hakomori, 

20 Ann. Rev. Immunol. 2:103 (1984)) or anti-idiotypic antibodies (Nepom et at., Proc. Natl. Acad. Sci. 81:2864 (1985); 
KoprowsW et a/., Proc Natl. Acad. Sci. 81:216 (1984)) relating to such antigens could be used to induce an active 
immune response in human patients. Such a response includes the formation of antibodies capable of activating human 
complement and mediating ADCC and by such mechanisms cause infected cell destruction. 

[0226] Optionally, the antibodies of this invention are useful in passively immunizing patients, as exemplified by the 

25 administration of humanized anti-HIV antibodies. 

[0227] The antibody compositions used in therapy are formulated and dosages established in a fashion consistent 
with good medical practice taking into account the disorder to be treated, the condition of the individual patient, the site 
of delivery of the composition, the method of administration and other factors known to practitioners. The antibody com- 
positions are prepared for administration according to the description of preparation of polypeptides for administration, 

30 infra. 

Deposit of Materials 

[0228] As described above, cultures of the muMAb4D5 have been deposited with the American Type Culture Collec- 

35 tion, 12301 ParWawn Drive, Rockvilie, MD, USA (ATCC). 

[0229] This deposit was made under the provisions of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest Treaty). 
This assures maintenance of viable cultures for 30 years from the date of the deposit. The organisms will be made avail- 
able by ATCC under the terms of the Budapest Treaty, and subject to an agreement between Genentech, Inc. and 

40 ATCC, which assures permanent and unrestricted availability of the progeny of the cultures to the public upon issuance 
of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent application, whichever comes 
first, and assures availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks 
to be entitled thereto according to 35 USC § 122 and the Commissioner's rules pursuant thereto (including 37 CFR 
§1.12 with particular reference to 886 OG 638). 

45 [0230] In respect of those designations in which a European patent is sought, a sample of the deposited microorgan- 
ism will be made available until the publication of the mention of the grant of the European patent or until the date on 
which the application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample 
to an expert nominated by the person requesting the sample. (Rule 28(4) EPC) 

[0231] The assignee of the present application has agreed that if the cultures on deposit should die or be lost or 
so destroyed when cultivated under suitable conditions, they will be promptly replaced on notification with a viable speci- 
men of the same culture. Availability of the deposited strain is not to be construed as a license to practice the invention 
in contravention of the rights granted under the authority of any government in accordance with its patent laws. 
[0232] The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the 
invention. The present invention is not to be limited in scope by the constructs deposited, since the deposited embodi- 
55 ments are intended to illustrate only certain aspects of the invention and any constructs that are functionally equivalent 
are within the scope of this invention. The deposit of material herein does not constitute an admission that the written 
description herein contained is inadequate to enable the practice of any aspect of the invention, including the best mode 
thereof, nor is it to be construed as limiting the scope of the claims to the specific illustrations that they represent. 
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Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to 
those skilled in the art from the foregoing description and fall within the scope of the appended claims. 
[0233] It is understood that the application of the teachings of the present invention to a specific problem or situation 
will be within the capabilities of one having* ordinary skill in the art in light of the teachings contained herein. Examples 
5 of the products of the present invention and representative processes for their isolation, use, and manufacture appear 
below, but should not be construed to limit the invention. 

EXAMPLES 

10 EXAMPLE 1. HUMANIZATION OF muMAb4D5 

[0234] Here we report the chimerization of muM Ab4D5 (chMAb4D5) and the rapid and simultaneous humanization 
of heavy (V H ) and light (VJ chain variable region genes using a novel "gene conversion mutagenesis' 1 strategy. Eight 
humanized variants (huMAb4D5) were constructed to probe the importance of several FR residues identified by our 
rs molecular modeling or previously proposed to be critical to the conformation of particular CDRs (see Chothia, C. & 
Lesk, A. M., J. Mol. Biol. 196:901-917 (1987); Chothia, C. et al., Nature 342:877-883 (1989); Tramontane A. et a/., J. 
Mol. Biol. 215:175-182 (1990)). Efficient transient expression of humanized variants in non-myeloma cells allowed us 
to rapidly investigate the relationship between binding affinity for p185 HER2 ECD and anti -proliferative activity against 
p185 HER2 overexpressing carcinoma cells. 

20 

MATERIALS and METHODS 

[0235] Cloning of Variable Region Genes. The muMAb4D5 V H and V L genes were isolated by polymerase chain 
reaction (PCR) amplification of mRNA from the corresponding hybridoma (Fendly, B. M. et a/., Cancer Res. 50:1550- 

25 1 558 (1 990)) as described by Orlandi et al. (Orlandi, R. et ai, Proc. Natl. Acad. Sci. USA 86:3833-3837 (1 989)). Amino 
terminal sequencing of muMAb4D5 V L and V H was used to design the sense strand PCR primers, whereas the anti- 
sense PCR primers were based upon consensus sequences of murine framework residues (Orlandi, R. et al., Proc. 
Natl. Acad. Sci. USA 86:3833-3837 (1989); Kabat, E. A. et al., Sequences of Proteins of Immunological Interest 
(National Institutes of Health, Bethesda, MD, 1987)) incorporating restriction sites for directional cloning shown by 

30 underlining and listed after the sequences: V L sense, S'-TCCOfflA^CAGCTGACCCAGTCTCCA-S' (SEQ. ID NO. 7), 
EcoRV; V L anti-sense, 5'-GTTTGATCTCCAGCTTGGTACCHSCDCCGAA-3' (SEQ. ID NO. 8), >Asp718; V H sense, 5'- 
AGGTSM ARCTGCAGSAGTCWGG-3' (SEQ. ID NO. 9), Pst\ and V H anti-sense, 5'-TGAGGAGAC GGTGACC GT- 
GGTCCCTTGGCCCCAG-3' (SEQ. ID NO. 10), BstEU; where H = A or C or T, S = C or G, D = A or G or T, M = A or C, 
R = A or G and W = A or T The PCR products were cloned into pUC1 19 (Vieira, J. & Messing, J., Methods Enzymol. 

35 1 53:3-1 1 (1 987)) and five clones for each variable domain sequenced by the dideoxy method (Sanger, F. et al., Proc. 
Natl. Acad. Sci. USA 74:5463-5467 (1977)). 

[0236] Molecular Modelling. Models for muMAb4D5 V H and V L domains were constructed separately from consen- 
sus coordinates based upon seven Fab structures from the Brookhaven protein data bank (entries 1 FB4, 2RHE, 2MCP, 
3 FAB, 1 FBJ, 2HFL and 1 REI). The Fab fragment KOL (Marquart, M. et ai, J. Mol. Biol. 141 :369-391 (1980)) was first 

40 chosen as a template for V L and V H domains and additional structures were then superimposed upon this structure 
using their main chain atom coordinates (INSIGHT program, Biosym Technologies). The distance from the template Ca 
to the analogous Ca in each of the superimposed structures was calculated for each residue position. If all (or nearly 
all) Ca-Ca distances for a given residue were 1 A, then that position was included in the consensus structure. In most 
cases the p-sheet framework residues satisfied these criteria whereas the CDR loops did not. For each of these 

45 selected residues the average coordinates for individual N, Ca, C, O and Cp atoms were calculated and then corrected 
for resultant deviations from non-standard bond geometry by 50 cycles of energy minimization using the DISCOVER 
program (Biosym Technologies) with the AMBER forcefield (Weiner, S. J. et al., J. Amer. Chem. Soc. 106:765-784 
(1984)) and Ca coordinates fixed. The side chains of highly conserved residues, such as the disulf ide-bridged cysteine 
residues, were then incorporated into the resultant consensus structure. Next the sequences of muMAb4D5 V L and Vh 

so were incorporated starting with the CDR residues and using the tabulations of CDR conformations from Chothia et al. 
(Chothia, C. et ai, Nature 342:877-883 (1 989)) as a guide. Side-chain conformations were chosen on the basis of Fab 
crystal structures, rotamer libraries (Ponder, J. W. & Richards, F. M., J. Mol. Biol. 193:775-791 (1987)) and packing con- 
siderations. Since V H -CDR3 could not be assigned a definite backbone conformation from these criteria, two models 
were created from a search of similar sized loops using the INSIGHT program. A third model was derived using packing 

55 and solvent exposure considerations. Each model was then subjected to 5000 cycles of energy minimization. 

[0237] In humanizing muMAb4D5, consensus human sequences were first derived from the most abundant sub- 
classes in the sequence compilation of Kabat et al. (Kabat, E. A. et al., Sequences of Proteins of Immunological Interest 
(National Institutes of Health, Bethesda, MD, 1987)), namely V L k subgroup I and V H group 111, and a molecular model 
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generated for these sequences using the methods described above. A structure for huMAb4D5 was created by trans- 
ferring the CDRs from the muMAb4D5 model into the consensus human structure. All huMAb4D5 variants contain 
human replacements of muM Ab4D5 residues at three positions within CDRs as defined by sequence variability (Kabat, 
E. A. et aL, Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)) but 
5 not as defined by structural variability (Chothia, C. & Lesk, A. M., J. Moi. Biol. 196:901-917 (1987)): V L -CDR1 K24R, 
V L -CDR2 R54L and V L -CDR2 T56S. Differences between muMAb4D5 and the human consensus framework residues 
(Fig. 1) were individually modeled to investigate their possible influence on CDR conformation and/or binding to the 
p185 HER2 ECD. 

[0238] Construction of Chimeric Genes. Genes encoding chMAb4D5 light and heavy chains were separately 

io assembled in previously described phagemid vectors containing the human cytomegalovirus enhancer and promoter, 
a 5' intron and SV40 polyadenylation signal (Gorman, C. M. et aL, DNA & Prot. Engin. Tech. 2:3-10 (1990)). Briefly, 
gene segments encoding muMAb4D5 V L (Fig. 1 A) and REI human k-j light chain Cl (Palm, W. & Hilschmann, N., Z 
Physiol. Chem. 356:167-191 (1975)) were precisely joined as were genes for muMAMDS V H (Fig. 1B) and human y\ 
constant region (Capon, D. J. et aL, Nature 337:525-531 (1989)) by simple subcloning (Boyle, A., in Current Protocols 

15 in Molecular Biology, Chapter 3 (F. A. Ausubel et aL, eds., Greene Publishing & Wiley- Interscience, New York, 1990)) 
and site-directed mutagenesis (Carter, P., in Mutagenesis: A Practical Approach, Chapter 1 (IRL Press, Oxford, UK 
1991)). The y1 isotype was chosen as it has been found to be the preferred human isotype for supporting ADCC and 
complement dependent cytotoxicity using matched sets of chimeric (Bruggemann, M. et aL, J. Exp. Med. 166:1351- 
1361 (1987)) or humanized antibodies (Riechmann, L. etaL, Nature 332:323-327 (1988)). The PCR-generated V L and 

20 V H fragments (Fig. 1) were subsequently mutagenized so that they faithfully represent the sequence of muMAb4D5 
determined at the protein level: V H Q1 E, V L V104L and T109A (variants are denoted by the amino acid residue and 
number followed by the replacement amino acid). The human /1 constant regions are identical to those reported by Elli- 
son et al. (Ellison, J. W. et aL, Nucleic Acids Res. 13:4071-4079 (1982)) except for the mutations E359D and M361L 
(Eu numbering, as in Kabat, E. A. etaL, Sequences of Proteins of Immunological Interest (National Institutes of Health, 

25 Bethesda, MD, 1987)) which we installed to convert the antibody from the naturally rare A allotype to the much more 
common non-A allotype (Tramontano, A. et al., J. Mol. Biol. 215:175-182 (1990)). This was an attempt to reduce the 
risk of anti-allotype antibodies interfering with therapy. 

[0239] Construction of Humanized Genes. Genes encoding chMAb4D5 light chain and heavy chain Fd fragment 
(V H and C H 1 domains) were subcloned together into pUC119 (Vieira, J. & Messing, J., Methods Enzymol. 153:3-11 

30 (1987)) to create pAK1 and simultaneously humanized in a single step (Fig. 2). Briefly, sets of 6 contiguous oligonucle- 
otides were designed to humanize V H and V L (Fig. 1 ). These oligonucleotides are 28 to 83 nucleotides in length, contain 
zero to 19 mismatches to the murine antibody template and are constrained to have 8 or 9 perfectly matched residues 
at each end to promote efficient annealing and ligation of adjacent oligonucleotides. The sets of V H and V L humaniza- 
tion oligonucleotides (5 pmol each) were phosphorylated with either ATP or y- 32 P-ATP (Carter, P. Methods Enzymol. 

35 1 54:382-403 (1 987)) and separately annealed with 3.7 pmol of pAK1 template in 40 nl 1 0 mM Tris-HCI (pH 8.0) and 1 0 
mM MgCI 2 by cooling from 100 °C to room temperature over - 30 min. The annealed oligonucleotides were joined by 
incubation with T4 DNA ligase (12 units; New England Biolabs) in the presence of 2 *il 5 mM ATP and 2 ^l 0.1 M DTT 
for 10 min at 14 °C. After electrophoresis on a 6% acrylamide sequencing gel the assembled oligonucleotides were 
located by autoradiography and recovered by electroelution. The assembled oligonucleotides (-0.3 pmol each) were 

40 simultaneously annealed to 0.15 pmol single-stranded deoxyuridine-containing pAK1 prepared according to Kunkel et 
al. (Kunkel, T. A. etaL, Methods Enzymol. 154:367-382 (1987)) in 10 jil 40 mM Tris-HCI (pH 7.5) and 16 mM MgCI 2 as 
above. Heteroduplex DNA was constructed by extending the primers with T7 DNA polymerase and transformed into E. 
coli BMH 71-18 mutt as previously described (Carter, P., in Mutagenesis: A Practical Approach, Chapter 1 (IRL Press, 
Oxford, UK 1991)). The resultant phagemid DNA pool was enriched first for huV L by restriction purification using Xho\ 

45 and then for huV H by restriction selection using Stu\ as described in Carter, P., in Mutagenesis: A Practical Approach, 
Chapter 1 (IRL Press, Oxford, UK 1991); and in Wells, J. A. etaL, Phil. Trans. R. Soc. Lond. A 317:415-423 (1986). 
Resultant clones containing both huV L and huV H genes were identified by nucleotide sequencing (Sanger, F. et aL, 
Proc. Natl. Acad. ScL USA 74:5463-5467 (1 977)) and designated pAK2. Additional humanized variants were generated 
by site-directed mutagenesis (Carter, R, in Mutagenesis: A Practical Approach, Chapter 1 (IRL Press, Oxford, UK 

so 1 991 )). The muM Ab4D5 V L and V H Gene segments in the transient expression vectors described above were then pre- 
cisely replaced with their humanized versions. 

[0240] Expression and Purification of MAb4D5 Variants. Appropriate MAb4D5 light and heavy chain cDNA expression 
vectors were co-transfected into an adenovirus transformed human embryonic kidney cell line, 293 (Graham, F. L. et 
aL, J. Gen. Virol. 36:59-72 (1977)) using a high efficiency procedure (Gorman, C. M. et aL, DNA & Prot. Engin. Tech. 
55 2:3-10 (1990); Gorman, C, in DNA Cloning, vol II, pp 143-190 (D. M. Glover, ed., IRL Press, Oxford, UK 1985)). Media 
were harvested daily for up to 5 days and the cells re-fed with serum free media. Antibodies were recovered from the 
media and affinity purified on protein A sepharose CL-4B (Pharmacia) as described by the manufacturer. The eluted 
antibody was buffer-exchanged into phosphate-buffered saline by G25 gel filtration, concentrated by ultrafiltration (Cen- 
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triprep-30 or Centricon-100, Amicon), sterile-filtered (Millex-GV, Millipore) and stored at 4 °C. The concentration of anti- 
body was determined by using both total immunoglobulin and antigen binding ELlSAs. The standard used was 
huMAb4D5-5, whose concentration had been determined by amino acid composition analysis. 

[0241] Cell Proliferation Assay. The effect of MAb4D5 variants upon proliferation of the human mammary adeno- 
5 carcinoma cell line, SK-BR-3, was investigated as previously described (Fondly, B. M. etal., Cancer Res. 50:1550-1558 
(1990)) using saturating MAb4D5 concentrations. 

[0242] Affinity Measurements. The antigen binding affinity of MAb4D5 variants was determined using a secreted 
form of the p185 HER2 ECD prepared as described in Fondly, B. M. etal., J. Biol. Resp. Mod 9:449-455 (1990). Briefly, 
antibody and p185 HER2 ECD were incubated in solution until equilibrium was found to be reached. The concentration 
io of free antibody was then determined by ELISA using immobilized p185 HER2 ECD and used to calculate affinity (K d ) 
according to Friguet et al. (Friguet, B. etal., J. Immunol. Methods 77:305-319 (1985)). 

RESULTS 

75 [0243] Humanization of muMAb4D5. The muMAb4D5 V L and V H gene segments were first cloned by PCR and 
sequenced (Fig. 1). The variable genes were then simultaneously humanized by gene conversion mutagenesis using 
preassembled oligonucleotides (Fig. 2). A 31 1-mer oligonucleotide containing 39 mismatches' to the template directed 
24 simultaneous amino acid changes required to humanize muMAb4D5 V L . Humanization of muMAb4D5 V H required 
32 amino acid changes which were installed with a 361 -mer containing 59 mismatches to the muMAb4D5 template. 

20 Two out of 8 clones sequenced precisely encode huMAb4D5-5, although one of mess clones contained a single nucle- 
otide imperfection. The 6 other clones were essentially humanized but contained a small number of errors: < 3 nucle- 
otide changes and < 1 single nucleotide deletion per kilobase. Additional humanized variants (Table 3) were 
constructed by site-directed mutagenesis of huMAb4D5-5. 

[0244] Expression levels of huMAb4D5 variants were in the range of 7 to 15 \ig/rr\\ as judged by ELISA using immo- 

25 bilized p185 HER2 ECD. Successive harvests of five 10 cm plates allowed 200 \ig to 500 mg of each variant to be pro- 
duced in a week. Antibodies affinity purified on protein A gave a single band on a Coomassie blue stained SDS 
polyacrylamide gel of mobility consistent with the expected M r of ~ 150 kDa. Electrophoresis under reducing conditions 
gave 2 bands consistent with the expected M x of free heavy (48 kDa) and light (23 kDa) chains (not shown). Amino ter- 
minal sequence analysis (10-cycles) gave the mixed sequence expected (see Fig. 1) from an equimolar combination of 

30 light and heavy chains (not shown). 

[0245] huMAb4D5 Variants. In general, the FR residues were chosen from consensus human sequences (Kabat, E. 
A. et al., Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)) and 
CDR residues from muMAb4D5. Additional variants were constructed by replacing selected human residues in 
huMAb4D5-1 with their muMAb4D5 counterparts. These are V H residues 71, 73, 78, 93 plus 102 and V L residues 55 

35 plus 66 identified by our molecular modeling. V H residue 71 has previously been proposed by others (Tramontano. A. 
etal., J. Mol. Biol 215:175-182 (1990)) to be critical to the conformation of V H -CD R2. Amino acid sequence differences 
between huMAb4D5 variant molecules are shown in Table 3, together with their p185 HER2 ECD binding affinity and 
maximal antiproliferative activities against SK-BR-3 cells. Very similar K d values were obtained for binding of MAb4D5 
variants to either SK-BR-3 cells or to p185 HER2 ECD (Table 3). However, K d estimates derived from binding of MAb4D5 

40 variants to p1 85 HER2 ECD were more reproducible with smaller standard errors and consumed much smaller quantities 
of antibody than binding measurements with whole cells. 

[0246] The most potent humanized variant designed by molecular modeling, huMAb4D5-8, contains 5 FR residues 
from muMAb4D5. This antibody binds the p185 HER2 ECD 3-fold more tightly than does muMAb4D5 itself (Table 3) and 
has comparable anti-proliferative activity with SK-BR-3 cells (Fig. 3). In contrast, huMAb4D5-1 is the most humanized 

45 but least potent muMAb4D5 variant, created by simply installing the muMAb4D5 CDRs into the consensus human 
sequences. huMAb4D5-1 binds the p185 HER2 ECD 80-fold less tightly than does the murine antibody and has no 
detectable anti-proliferative activity at the highest antibody concentration investigated (18 jig/ml). 
[0247] The anti-proliferative activity of huMAb4D5 variants against p185 HER2 overexpressing SK-BR-3 cells is not 
simply correlated with their binding affinity for the p1 85 HER2 ECD. For example, installation of three murine residues into 

so the V H domain of huMAb4D5-2 (D73T, L78A and A93S) to create huMAb4D5-3 does not change the antigen binding 
affinity but does confer significant anti-proliferative activity (Table 3). 

[0248] The importance of V H residue 71 (Tramontano, A. etal., J. Mol. Biol. 215:175-182 (1990)) is supported by the 
observed 5-fold increase in affinity for p185 HER2 ECD on replacement of R71 in huMAb4D5-1 with the corresponding 
murine residue, alanine (huMAb4D5-2). In contrast, replacing V H L78 in huMAb4D5-4 with the murine residue, alanine 
55 (huMAb4D5-5), does not significantly change the affinity for the p185 HER2 ECD or change anti-proliferative activity, sug- 
gesting that residue 78 is not of critical functional significance to huMAb4D5 and its ability to interact property with the 
extracellular domain of p185 HER2 

[0249] V L residue 66 is usually a glycine in human and murine k chain sequences (Kabat, E. A. et al., Sequences of 
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Proteins of Immunological Interest (National Institutes of Health, Bethesda, MD, 1987)) but an arginine occupies this 
position in the muMAb4D5 k light chain. The side chain of residue 66 is likely to affect the conformation of V L -CDR1 and 
V L -CDR2 and the hairpin turn at 68-69 (Fig. 4). Consistent with the importance of this residue, the mutation V L Q66R 
(huMAb4D5-3 -> huMAb4D5-5) increases the affinity for the p185 HER2 ECD by 4-fold with a concomitant increase in 
5 antiproliferative activity. 

[0250] From molecular modeling it appears that the tyrosyl side chain of muM Ab4D5 V L residue 55 may either stabi- 
lize the conformation of V H -CDR3 or provide an interaction at the V L -V H interface. The latter function may be dependent 
upon the presence of V H Y102. In the context of huMAb4D5-5 the mutations V L E55Y (huMAb4D5-6) and V H V102Y 
(huMAb4D5-7) individually increase the affinity for p185 HER2 ECD by 5-fold and 2-fold respectively, whereas together 

w (huMAb4D5-8) they increase the affinity by 1 1 -fold. This is consistent with either proposed role of V L Y55 and V H Y1 02. 
[0251 ] Secondary Immune Function of huMAb4D5-8. MuMAb4D5 inhibits the growth of human breast tumor cells 
which overexpress p185 HER2 (Hudziak, R. M. et al., Molec. Cell. Biol. 9:1165-1172 (1989)). The antibody, however, 
does not offer the possibility of direct tumor cytotoxic effects. This possibility does arise in huMAb4D5-8 as a result of 
its high affinity (K d = 0.1 ^M) and its human lgG 1 subtype. Table 4 compares the ADCC mediated by huMAb4D5-8 with 

is muMAb4D5 on a normal lung epithelial cell line, WI-38, which expresses a low level of p185 HER2 and on SK-BR-3, 
which expresses a high level of p185 HER2 . The results demonstrate that: (1) huMAb4D5 has a greatly enhanced ability 
to carry out ADCC as compared with its murine parent; and (2) that this activity may be selective for cell types which 
overexpress p185 HER2 . 

20 DISCUSSION 

[0252] MuMAb4D5 is potentially useful for human therapy since it is cytostatic towards human breast, and ovarian 
tumor lines overexpressing the HEH2-encoded p185 HER2 receptor-like tyrosine kinase. Since both breast and ovarian 
carcinomas are chronic diseases it is anticipated that the optimal MAb4D5 variant molecule for therapy will have low 

25 immunogenicity and will be cytotoxic rather than solely cytostatic in effect. Humanization of muMAb4D5 should accom- 
plish these goals. We have identified 5 different huMAb4D5 variants which bind tightly to p185 HER2 ECD {K d z 1 nM) 
and which have significant anti -proliferative activity (Table 3). Furthermore huMAb4D5-8 but not muMAb4D5 mediates 
ADCC against human tumor cell lines overexpressing p185 HER2 in the presence of human effector cells (Table 4) as 
anticipated for a human y1 isotype (Bruggemann, M. et ai, J. Exp. Med. 166:1351-1361 (1987); Riechmann, L. etal., 

30 Nature 332:323-327 (1 988)). 

[0253] Rapid humanization of huMAb4D5 was facilitated by the gene conversion mutagenesis strategy developed 
here using long preassembled oligonucleotides. This method requires less than half the amount of synthetic DNA as 
does total gene synthesis and does not require convenient restriction sites in the target DNA. Our method appears to 
be simpler and more reliable than a variant protocol recently reported (Rostapshov, V. M. et al., FEBS Lett. 249:379- 

35 382 (1989)). Transient expression of huMAb4D5 in human embryonic kidney 293 cells permitted the isolation of a few 
hundred micrograms of huMAb4D5 variants for rapid characterization by growth inhibition and antigen binding affinity 
assays. Furthermore, different combinations of light and heavy chain were readily tested by co-transfection of corre- 
sponding cDNA expression vectors. 

[0254] The crucial role of molecular modeling in the humanization of muMAb4D5 is illustrated by the designed variant 

40 huMAb4D5-8 which binds the p185 HER2 ECD 250-fold more tightly than the simple CDR loop swap variant, huMAb4D5- 
1 . It has previously been shown that the antigen binding affinity of a humanized antibody can be increased by mutagen- 
esis based upon molecular modelling (Riechmann, L etal., Nature 332:323-327 (1986); Queen, C. et al, Proc. Natl. 
Acad. Sci. USA 86:10029-10033 (1989)). Here we have extended this earlier work by others with a designed human- 
ized antibody which binds its antigen 3-fold more tightly than the parent rodent antibody. While this result is gratifying, 

45 assessment of the success of the molecular modeling must await the outcome of X-ray structure determination. From 
analysis of huMAb4D5 variants (Table 3) it is apparent that their anti-proliferative activity is not a simple function of their 
binding affinity for p185 HER2 ECD. For example the huMAb4D5-8 variant binds p185 HER2 3-fold more tightly than 
muMAb4D5 but the humanized variant is slightly less potent in blocking the proliferation of SK-BR-3 cells. Additional 
huMAb4D5 variants are currently being constructed in an attempt to identify residues triggering the anti-proliferative 

so activity and in an attempt to enhance this activity. 

[0255] In addition to retaining tight receptor binding and the ability to inhibit cell growth, the huMAb4D5-8 also confers 
a secondary immune function (ADCC). This allows for direct cytotoxic activity of the humanized molecule in the pres- 
ence of human effector cells. The apparent selectivity of the cytotoxic activity for cell types which overexpress p185 HER2 
allows for the evolution of a straightforward clinic approach to those human cancers characterized by overexpression of 

55 the HER2 protooncogene. 
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Table 3. p1 85 HER2 ECD binding affinity and anti-prolrferative activities of MAb4D5 variants 



V„ Residue* 



V t Residue* 
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HAMD5 
cell 


71 


73 


78 


93 


102 


55 


66 




Relative 




Variant 


FR3 


FR3 


PR3 


FR3 


CDR3 


CDR2 


FR3 


nU 




15 


prol i f era£ i on* 


















• 




huMAb4D5-l 


R 


D 


L 


A 


V 


E 


G 


25 


102 


20 


huMAb4D5-2 
huMAb4D5 - 3 


Ala 
Ala 


D 
Thr 


L 
Ala 


A 

Ser 


V 
V 


B 
5 


G 
G 


102 
466 






huMAb4D5-4 


Ala 


Thr 


L 


Ser 


V 


E 


Arg 


0.B8 




25 


huMAb4D5-5 
huMAb4D5-6 


Ala 
Ala 


Thr 
Thr 


Ala 
Ala 


Ser 
Ser 


V 
V 


B 
Tyr 


Arg 
Arg 


148 

0.82 


9 




huMAb4D5 - 7 


Ala 


Thr 


Ala 


Ser 


Tyr 


E 


Arg 


0.52 




30 


huMAb4D5-8 


Ala 


Thr 


Ala 


Ser 


Tyx 


Tyr 


Arg 


o.se 






muMAb4D5 


Ala 


Thr 


Ala 


Ser 


Tyr 


Tyr 


Arg 


0.30 





35 



40 



45 



50 



* Human and murine residues are shown in one letter and three letter amino acid code 
respectively. 

1 Af d values for the p1 85 HER2 ECD were determined using the method of Friouet et al. (43) 
and the standard error of each estimate is s ± 10%. 

* Proliferation of SK-BR-3 cells incubated for 96 hr with MAb4D5 variants shown as a 
percentage of the untreated control as described (Hudziak, R. M. et al, t Molec. Cell. Biol. 
9: 1 1 65-1 1 72 (1 989)). Data represent the maximal anti-proliferative effect for each variant 
(see Fig. 3A) calculated as the mean of triplicate determinations at a MAb4D5 
concentration of 8 //g/ml. Data are ail taken from the same experiment with an estimated 
standard error of £ ± 15%. 
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Table 4 



15 



20 



25 



Selectivity of antibody dependent tumor cell cytotoxicity mediated by huMAb4D5-8 




WI-38* 


SK-BR-3 


Effector Target ratio f 


muMAb4D5 


huMAb4D5-8 


muMAMDS 


huMAb4D5-8 


A.* 


25:1 


<1.0 


9.3 


7.5 


40.6 




12.5:1 


<1.0 


11.1 


4.7 


36.8 




6.25:1 


<1.0 


8.9 


0.9 


35.2 




3.13:1 


<1.0 


8.5 


4.6 


19.6 


B. 


25:1 


<1.0 


3.1 


6.1 


33.4 




12.5:1 


<1.0 


1.7 


5.5 


26.2 




6.25:1 


1.3 


2.2 


2.0 


21.0 




3.13:1 


<1.0 


0.8 


2.4 


13.4 



* Sensitivity to ADCC of two human cell lines (WI-38, normal lung epithelium; and SK-BR-3, 
human breast tumor cell line) are compared. WI-38 expresses a low level of p185 HER2 (0.6 pg 
per ug cell protein) and SK-BR-3 expresses a high level of p1 85 HER2 (64 pg p1 85 HER2 per \ig cell 
protein), as determined by ELISA (Fondly etal, J. Biol Resp. Mod. 9:449-455 (1990)). 

+ ADCC assays were carried out as described in BrOggemann etal., J. Exp. Med. 166:1351-1361 
(1987). Effector to target ratios were of IL-2 activated human peripheral blood lymphocytes to 
either WI-38 fibroblasts or SK-BR-3 tumor cells in 96- well microliter plates for 4 hours at 37 °C. 
Values given represent percent specific cell lysis as determined by 51 Cr release. Estimated 
standard error in these quadruplicate determinations was £ ± 1 0%. 

* Monoclonal antibody concentrations used were 0.1 ^g/ml (A) and 0.1 ng/ml (B). 



30 

EXAMPLE 2. Schematic Method for Humanizing an Antibody Sequence 



[0256] This example illustrates one stepwise elaboration of the methods for creating a humanized sequence 
described above. It will be understood that not all of these steps are essential to the claimed invention, and that steps 
35 may be taken in different order. 



1 . ascertain a consensus human variable domain amino acid sequence and prepare from it a consensus structural 
model. 

2. prepare model of import (the non-human domain to be humanized) variable domain sequences and note struc- 
40 tural differences with respect to consensus human model. 

3. identify CDR sequences in human and in import, both by using Kabat (supra, 1 987) and crystal structure criteria. 
If there is any difference in CDR identity from the different criteria, use of crystal structure definition of the CDR, but 
retain the Kabat residues as important framework residues to import. 

4. substitute import CDR sequences for human CDR sequences to obtain initial "humanized" sequence. 

45 5. compare import non-CDR variable domain sequence to the humanized sequence and note divergences. 

6. Proceed through the following analysis for each amino acid residue where the import diverges from the human- 
ized. 

a. If the humanized residue represents a residue which is generally highly conserved across all species, use 
so the residue in the humanized sequence. If the residue is not conserved across all species, proceed with the 

analysis described in 6b. 

b. If the residue is not generally conserved across all species, ask if the residue is generally conserved in 
humans. 



55 i. If the residue is generally conserved in humans but the import residue differs, examine the structural 

models of the import and human sequences and determine if the import residue would be likely to affect 
the binding or biological activity of the CDRs by considering 1) could it bind antigen directly and 2) could it 
affect the conformation of the CDR. If the conclusion is that an affect on the CDRs is likely, substitute the 
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import residue. If the conclusion is that a CDR affect is unlikely, leave the humanized residue unchanged. 

ii. If the residue is also not generally conserved in humans, examine the structural models of the import 
and human sequences and determine if the import residue would be likely to affect the binding or biologi- 
cal activity of the CDRs be considering 1) could it bind antigen directly and 2) could it affect the conforma- 
5 tion of the CDR. If the conclusion is that an affect on the CDRs is likely, substitute the import residue. If the 

conclusion is that a CDR affect is unlikely, proceed to the next step. 

a) Examine the structural models of the import and human sequences and determine if the residue is 
exposed on the surface of the domain or is buried within. If the residue is exposed, use the residue in 
w the humanized sequence. If the residue is buried, proceed to the next step. 

(i) Examine the structural models of the import and human sequences and determine if the resi- 
due is likely to affect the V L - V H interface. Residues involved with the interface include: 34L, 36L, 
38L, 43L, 33L, 36L t 85L, 87L, 89L, 91 L, 96L, 98L, 35H, 37H, 39H, 43H, 45H, 47H, 60H, 91 H, 93H, 
15 95H, 100H, and 103H. If no effect is likely, use the residue in the humanized sequence. If some 

affect is likely, substitute the import residue. 

7. Search the import sequence, the consensus sequence and the humanized sequence for glycosylation sites out- 
side the CDRs, and determine if this glycosylation site is likely to have any affect on antigen binding and/or biolog- 
ic? ical activity. If no effect is likely, use the human sequence at that site; if some affect is likely, eliminate the 

glycosylation site or use the import sequence at that site. 

8. After completing the above analysis, determine the planned humanized sequence and prepare and test a sam- 
ple. If the sample does not bind well to the target antigen, examine the particular residues listed below, regardless 
of the question of residue identity between the import and humanized residues. 

25 

a. Examine particular peripheral (non-CDR) variable domain residues that may, due to their position, possibly 
interact directly with a macromolecular antigen, including the following residues (where the * indicates residues 
which have been found to interact with antigen based on crystal structures): 

30 i. Variable light domain: 36, 46, 49*. 63-70 

ii. Variable heavy domain: 2, 47*. 68, 70, 73-76. 

b. Examine particular variable domain residues which could interact with, or otherwise affect, the conformation 
of variable domain CDRs, including the following (not including CDR residues themselves, since it is assumed 

35 that, because the CDRs interact with one another, any residue in one CDR could potentially affect the confor- 

mation of another CDR residue) (L = LIGHT, H = HEAVY, residues appearing in bold are indicated to be struc- 
turally important according the Chothia et a/., Nature 342:877 (1989), and residues appearing in italic were 
altered during humanization by Queen et ai. (PDL), Proc. Natl. Acad. Sci. USA 86:10029 (1989) and Proc. 
Natl. Acad. Sci. USA 88:2869 (1991).): 

40 

I Variable light domain: 

a) CDR-1 (residues 24L-34L): 2L, 4L, 66L-69L, 71 L 

b) CDR-2 (residues 50L-56L): 35L, 46L, 47L, 48L, 49L, 58L, 62L, 64L-66L, 71 L, 73L 
45 c) CDR-3 (residues 89L-97L): 2L, 4L, 36L, 98L, 37H, 45H, 47H, 58H, 60H 

ii. Variable heavy domain: 

a) CDR-1 (residues 26H-35H): 2H, 4H, 24H, 36H, 71 H, 73H, 76H, 78H, 92H, 94H 
50 b) CDR-2 (residues 50H-55H): 49H, 69H, 69H, 71H, 73H, 78H 

c) CDR-3 (residues 95H-102H): examine all residues as possible interaction partners with this loop, 
because this loop varies in size and conformation much more than the other CDRs. 

9. If after step 8 the humanized variable domain still is lacking in desired binding, repeat step 8. In addition, re- 
55 investigate any buried residues which might affect the V L - V H interface (but which would not directly affect CDR 

conformation). Additionally, evaluate the accessibility of non-CDR residues to solvent. 
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EXAMPLE 3. Engineering a Humanized Bispecific F(abl2 Fragment 

[0257] This example demonstrates the construction of a humanized bispecific antibody (BsFfab^vl by separate E 
coli expression of each Fab' arm followed by directed chemical coupling in vitro. BsFfab^ v1 (anti-CD3 / anti- 

s p185 HER2 ) was demonstrated to retarget the cytotoxic activity of human CD3 + CTL in vitro against the human breast 
tumor cell line, SK-BR-3, which overexpresses the p185 HER2 product of the protooncogene HER2. This example dem- 
onstrates the minimalistic humanization strategy of installing as few murine residues as possible into a human antibody 
in order to recruit antigen-binding affinity and biological properties comparable to that of the murine parent antibody. 
This strategy proved very successful for the anti-p1 85 HER2 arm of BsF(ab') 2 v1 . In contrast BsFfab^ v1 binds to T cells 

10 via its anti-CD3 arm much less efficiently than does the chimeric BsF(ab*) 2 which contains the variable domains of the 
murine parent anti-CD3 antibody. Here we have constructed additional BsF(ab*) 2 fragments containing variant anti-CD3 
arms with selected murine residues restored in an attempt to improve antibody binding to T cells. One such variant, Bs 
F(ab')2 v9, was created by replacing six residues in the second hypervariable loop of the anti-CD3 heavy chain variable 
domain of BsF(ab*)3 v1 with their counterparts from the murine parent anti-CD3 antibody. BsF(ab') 2 v9 binds to T cells 

15 (Jurkat) much more efficiently than does BsF(ab") 2 v1 and almost as efficiently as the chimeric BsFfabV This improve- 
ment in the efficiency of T cell binding of the humanized BsF(ab') 2 is an important step in its development as a potential 
therapeutic agent for the treatment of p1 85 HER2 -overexpressing cancers. 

[0258] Bispecific antibodies (BsAbs) with specificities for tumor-associated antigens and surface markers on immune 
effector cells have proved effective for retargeting effector cells to kill tumor targets both in vitro and in vivo (reviewed 

20 by Fanger, M. W. et a/., Immunol. Today 10: 92-99 (1989); Fanger, M. W. era/., Immunol. Today 12: 51-54 (1991); and 
Nelson, H. ( Cancer Cells 3:163-172 (1991)). BsF(ab') 3 fragment? have often been used in preference to intact BsAbs 
in retargeted cellular cytotoxicty to avoid the risk of killing innocent bystander cells binding to the Fc region of the anti- 
body. An additional advantage of BsF(ab') 2 over intact BsAbs is that they are generally much simpler to prepare free of 
contaminating monospecific molecules (reviewed by Songsivilai, S. and Lachmann, P. J., Clin. Exp. Immunol. 79:315- 

25 321 (1990) and Nolan, O. and O'Kennedy, R., Biochim. Biophys. Acta 1040: 1-11 (1990)). 

[0259] BsFJab^ fragments are traditionally constructed by directed chemical coupling of Fab' fragments obtained by 
limited proteolysis plus mild reduction of the parent rodent monoclonal Ab (Brennan, M. et al., Science 229, 81-83 
(1985) ondGlennie, M. J. etal, J. Immunol. 139: 2367-2375 (1987)). One such BsFfab'k fragment (arrti-glioma asso- 
ciated antigen / anti-CD3) was found to have clinical efficacy in glioma patients (Nitta, T et af., Lancet 335: 368-371 

30 (1990) and another BsFfab^ (anti-indium chelate / anti-carcinoembryonic antigen) allowed clinical imaging of colorec- 
tal carcinoma (Stickney, D. R. et at., Antibody, Immunoconj. Radiopharm. 2:1-13 (1 989)). Future BsFfcb'fe destined for 
clinical applications are likely to be constructed from antibodies which are either human or at least "humanized" (Riech- 
mann, L. et al., Nature 332: 323-327 (1988) to reduce their immunogenicity (Hale, G. et al., Lancet i: 1394-1399 
(1988)). 

35 [0260] Recently a facile route to a fully humanized BsF(ab*) 2 fragment designed for tumor immunotherapy has been 
demonstrated (Shalaby. M. R. etal., J. Exp. Med. 175: 217-225 (1992)). This approach involves separate E. coli expres- 
sion of each Fab' arm followed by traditional directed chemical coupling in vitro to form the BsF(ab') 2 . One arm of the 
BsF(ab*) 2 was a humanized version (Carter, R et al., Proc. Natl. Acad. Sci. USA (1992a) and Carter, P., et al., 
Bio/Technology 10: 163-167 (1992b)) of the murine monoclonal Ab 4D5 which is directed against the p185 HER2 product 

40 of the protooncogene HER2 {c-erbB-2) (Fendly, B. M. etal.. Cancer Res. 50: 1550-1558 (1989)). The humanization of 
the antibody 4D5 is shown in Example 1 of this application. The second arm was a minimalistically humanized anti-CD3 
antibody (Shalaby et al. supra) which was created by installing the CDR loops from the variable domains of the murine 
parent monoclonal Ab UCHT1 (Beverley. P. C. L and Collard. R. E., Eur. J. Immunol. 11: 329-334 (1981)) into the 
humanized anti-p185 HER2 antibody. The BsFfab'k fragment containing the most potent humanized anti-CD3 variant 

45 (v1) was demonstrated by flow cytometry to bind specifically to a tumor target overexpressing p1 85 HER2 and to human 
peripheral blood mononuclear cells carrying CD3. In addition, Bs F(ab' ) 2 v1 enhanced the cytotoxic effects of activated 
human CTL 4-fold against SK-BR-3 tumor cells overexpressing p185 HER2 . The example descries efforts to improve the 
antigen binding affinity of the humanized anti-CD3 arm by the judicious recruitment of a small number of additional 
murine residues into the minimalistically humanized anti-CD3 variable domains. 

50 

MATERIALS AND METHODS 

Construction of mutations in the anti-CD3 variable region genes. 

55 [0261 ] The construction of genes encoding humanized anti-CD3 variant 1 (v1 ) variable light (VJ and heavy (V H ) chain 
domains in phagemid pUC1 19 has been described (Shalaby etal. supra). Additional anti-CD3 variants were generated 
using an efficient site-directed mutagenesis method (Carter, R, Mutagenesis: a practical approach, (M. d. McPherson, 
Ed.), Chapter 1, IRL Press, Oxford, UK (1991)) using mismatched oligonucleotides which either install or remove 
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unique restriction sites. Oligonucleotides used are listed below using lowercase to indicate the targeted mutations. Cor- 
responding coding changes are denoted by the starting amino acid in one letter code followed by the residue numbered 
according to Kabat, E. A. et al., Sequences of Proteins of Immunological Interest, 5 th edition, National Institutes of 
Health, Bethesda, MD, USA (1991), then the replacement amino acid and finally the identity of the anti-CD3 variant: 

5 

HX11, 5' GTAGATAAATCCtCtAACACAGCCTAtCTGCAAATG 3' (SEQ.ID. NO. 1 1) V H K75S, v6; 
HX12.5* GTAGATAAATCCAAAtCtACAGCCTAtCTGCAAATG 3' (SEQ.ID. NO. 12) V H N76S, v7; 
HX13, 5' GTAGATAAATCCtCttctACAGCCTAtCTGCAAATG 3' (SEQ.ID. NO. 13) V H K75S:N76S, v8; 
X14, 5' CTTATAAAGGTGTTtCcACCTATaaCcAgAaatTCAA GGatCGTTTCACgATAtcCGTAGATAAATCC 3' 
10 (SEQ.ID.NO. 14) V H T57S:A60N:D61Q:S62K:V63F:G65D, v9; 

LX6, 5' CTATACCTCCCGTCTgcatTCTGGAGTCCC 3' (SEQ.ID, NO. 15) V L E55H, v11. 

Oligonucleotides HX1 1 , HX12 and HX1 3 each remove a site for BspMI, whereas LX6 removes a site for Xhol and HX1 4 
installs a site for EcoRV (bold). Anti-CD3 variant v10 was constructed from v9 by site-directed mutagenesis using oli- 
15 gonucleotide HX13. Mutants were verified by dideoxy nucleotide sequencing (Sanger, F. et al., Proa Natl. Acad. Sci. 
USA 74: 5463-5467 (1977)). 

£ coli expression of Fab* fragments 

20 [0262] The expression plasmid, pAK19, for the co-secretion of light chain and heavy chain Fd* fragment of the most 
preferred humanized anti-pi 85 HER2 variant, HuMAb4D5-8, is described in Carter et al., 19923, supra. Briefly, the Fab* 
expression unit is bicistronic with both chains under the transcriptional control of the phofK promoter. Genes encoding 
humanized V L and V H domains are precisely fused on their 5' side to a gene segment encoding the heat-stable enter- 
otoxin II signal sequence and on their 3' side to human k-j C L and lgG1 C H 1 constant domain genes, respectively. The 

25 C H 1 gene is immediately followed by a sequence encoding the hinge sequence CysAlaAla and followed by a bacteri- 
ophage X t Q transcriptional terminator. Fab' expression plasmids for chimeric and humanized anti-CD3 variants (v1 to 
v4, Shalaby et al.,supra; v6 to v12, this study) were created from pAK19 by precisely replacing anti-p185 HER2 V L and 
V H gene segments with those encoding murine and corresponding humanized variants of the anti-CD3 antibody, 
respectively, by sub-cloning and site-directed mutsgenesis. The Fab' expression plasmid for the most potent humanized 

30 anti-CD3 variant identified in this study (v9) is designated pAK22. The anti-p1 85 HER2 Fab' fragment was secreted from 
£ coli K12 strain 25F2 containing plasmid pAK19 grown for 32 to 40 hr at 37°C in an aerated 10 liter fermentor. The 
final cell density was 1 20-1 50 OD^ and the titer of soluble and functional anti-p1 85 HER2 Fab' was 1 -2 g/liter as judged 
by antigen binding ELISA (Carter et al., 1992b, supra). Anti-CD3 Fab' variants were secreted from E. coli containing 
corresponding expression plasmids using very similar fermentation protocols. The highest expression titers of chimeric 

35 and humanized anti-CD3 variants were 200 mg/lrter and 700 mg/liter, respectively, as judged by total immunoglobulin 
ELISA. 

Construction ofBsF(ab') 2 fragments 

40 [0263] Fab' fragments were directly recovered from £ coli fermentation pastes in the free thiol form (Fab -SH) by affin- 
ity purification on Streptococcal protein G at pH 5 in the presence of EDTA (Carter et al. , 1992b supra). Thioether linked 
BsF(ab') 2 fragments (anti-p185 HER2 /anti-CD3) were constructed by the procedure of Glennie et al. supra with the fol- 
lowing modifications. Anti-p185 HER2 Fab'-SH in 100 mM Tris acetate, 5 mM EDTA (pH 5.0) was reacted with 0.1 vol of 
40 mM N,N'-1,2-phenylenedimalemide (o-PDM) in dimethyl formamide for ~ 1.5 hr at 20 °C. Excess o-PDM was 

45 removed by protein G purification of the Fab' maleimide derivative (Fab'-mal) followed by buffer exchange into 20 mM 
sodium acetate, 5 mM EDTA (pH 5.3) (coupling buffer) using centriprep-30 concentrators (Amicon). The total concen- 
tration of Fab' variants was estimated from the measured absorbance at 280 nm (HuMAb4D5-8 Fab' e°* 1% = 1.56, 
Carter et al., 1992b, supra). The free thiol content of Fab' preparations was estimated by reaction with 5, 5'-dithiobis(2- 
nitrobenzoic acid) as described by Creighton, T. E., Protein structure; a practical approach, (T. E. Creighton, Ed.), 

so Chapter 7, IRL Press, Oxford, UK (1990). Equimolar amounts of anti-p185 HER2 Fab'-mal (assuming quantitative reac- 
tion of Fab'-SH with o-PDM) and each anti-CD3 Fab'-SH variant were coupled together at a combined concentration of 
1 to 2.5 mg/ml in the coupling buffer for 14 to 48 hr at 4 °C. The coupling reaction was adjusted to 4 mM cysteine at pH 
7.0 and incubated for 15 min at 20 ° C to reduce any unwanted disulfide-l inked F(ab* ) 2 formed. These reduction condi- 
tions are sufficient to reduce inter-heavy chain disulfide bonds with virtually no reduction of the disulfide between light 

55 and heavy chains. Any free thiols generated were then blocked with 50 mM iodoacetamide. BsF(ab*) 2 was isolated from 
the coupling reaction by S100-HR (Pharmacial size exclusion chromatooraphy (2.5 cm x 100 cm) in the presence of 
PBS. The BsFfab^ samples were passed through a 0.2 mm filter flash frozen in liquid nitrogen and stored at -70°C. 
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Flow cytometric analysts of Flab')^binding to Jurkat cells 

[0264] The Jurkat human acute T cell leukemia cell line was purchased from the American Type Culture Collection 
(Rockville, MD) (ATCC TIB 152) and grown as recommended by the ATCC. Aliquots of 10 6 Jurkat cells were incubated 

5 with appropriate concentrations of BsF(ab'>2 (anti-p1 85 HER2 / anti-CD3 variant) or control mono-specific anti-p1 85 HER2 
Ffab 1 ^ in PBS plus 0.1% (w/v) bovine serum albumin and 10 mM sodium azide for 45 min at 4 °C. The cells were 
washed and then incubated with fluorescein-conjugated goat anti-human FJab^ (Organon Teknika, West Chester, PA) 
for 45 min at 4 °C. Cells were washed and analyzed on a FACScan® (Becton Dickinson and Co., Mountain View, CA). 
Cells (8 x 10 3 ) were acquired by list mode and gated by forward light scatter versus side light scatter excluding dead 

w cells and debris. 

RESULTS 

Design of humanized anti-CD3 variants 

15 

[0265] The most potent humanized anti-CD3 variant previously identified, v1 , differs from the murine parent antibody, 
UCHT1 at 19 out of 107 amino acid residues within V L and at 37 out of 122 positions within V H (Shalaby et al. f supra) 
1992). Here we recruited back additional murine residues into anti-CD3 v1 in an attempt to improve the binding affinity 
for CD3. The strategy chosen was a compromise between minimizing both the number of additional murine residues 

20 recruited and the number of anti-CD3 variants to be analyzed. We focused our attentions on a few CDR residues which 
were originally kept as human sequences in our minimalistic humanization regime. Thus human residues in V H CDR2 
of anti-CD3 v1 were replaced en bloc with their murine counterparts to give anti-CD3 v9: 
T57S:A60N:D61Q:S62K:V63F:G65D (Fig. 5). Similarly, the human residue E55 in V L CDR2 of anti-CD3 v1 was 
replaced with histidine from the murine anti-CD3 antibody to generate anti-CD3 v1 1. In addition, V H framework region 

25 (FR) residues 75 and 76 in anti-CD3 v1 were also replaced with their murine counterparts to create anti-CD3 v8: 
K75S:N76S. V H residues 75 and 76 are located in a loop close to V H CDR1 and CDR2 and therefore might influence 
antigen binding. Additional variants created by combining mutations at these three sites are described below. 
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Preparation of BsF(ab') 2 fragments 



[0266] Soluble and functional anti-p1 85 HER2 and anti-CD3 Fab* fragments were recovered directly from correspond- 
ing E. coli fermentation pastas with the single hinge cysteine predominantly in the free thiol form (75-100 % Fab'-SH) 
by affinity purification on Streptococcal protein G at pH 5 in the presence of EDTA (Carter et al., 1992b, supra). 
Thioether-linked BsF(ab') 2 fragments were then constructed by directed coupling using o-PDM as described by Glennie 

35 et al, supra. One arm was always the most potent humanized anti-p185 HER2 variant, HuMAb4D5-8 (Carter ef al., 
1992a, supra) and the other either a chimeric or humanized variant of the anti-CD3 antibody. Anti-p185 HER2 Fab'-SH 
was reacted with o-PDM to form the maleimide derivative (Fab -mal) and then coupled to the Fab'-SH for each anti-CD3 
variant. Ftab^ was then purified away from unreacted Fab' by size exclusion chromatography as shown for a represent- 
ative preparation (BsF(ab") 2 v8) in data not shown. The Ffab^ fragment represents - 1 54% of the total amount of anti- 

40 body fragments (by mass) as judged by integration of the chromatograph peaks. 

[0267] SDS-PAGE analysis of this BdF(ab*) 2 v8 preparation under non-reducing conditions gave one major band with 
the expected mobility (M t ~ 96 kD) as well as several very minor bands (data not shown). Amino-terminal sequence 
analysis of the major band after electroblotting on to polyvinylidene difluoride membrane Matsudaira, P., J. Biol. Chem. 
262: 10035-10038 (1987) gave the expected mixed sequence from a stoichiometric 1:1 mixture of light and heavy 

45 chains (V L /V H : D/E, l/V, Q,Q, M/L, T/V, Q/E, S/S) expected for BsFfabV The amino terminal region of both light chains 
are identical as are both heavy chains and correspond to consensus human FR sequences. We have previously dem- 
onstrated that F(ab*) 2 constructed by directed chemical coupling carry both anti-p185 HER2 and anti-CD3 antigen spe- 
cificities (Shalaby et al., supra). The level of contamination of the BsFfab^ with monospecific Flab* ) 2 is likely to be very 
low since mock coupling reactions with either anti-p185 HER2 Fab'-mal or anti-CD3 Fab'-SH alone did not yield detecta- 

50 ble quantities of Flab' ) 2 . Furthermore the coupling reaction was subjected to a mild reduction step followed by alkylation 
to remove trace amounts of disulfide-linked F(ab' ) 2 that might be present. SDS-PAGE of the purified F(ab' ) 2 under 
reducing conditions gave two major bands with electrophoretic mobility and amino terminal sequence anticipated for 
free light chain and thioether-linked heavy chain dimers. 

[0268] Scanning LASER densitometry of a o-PDM coupled F(ab) 2 preparation suggest that the minor species 
55 together represent ~ 10% of the protein. These minor contaminants were characterized by amino terminal sequence 
analysis and were tentatively identified on the basis of stoichiometry of light and heavy chain sequences and their elec- 
trophoretic mobility (data not shown). These data are consistent with the minor contaminants including imperfect F(ab' 
) 2 in which the disulfide bond between light and heavy chains is missing in one or both arms, trace amounts of Fab' and 
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heavy chain thioelher-linked to light chain. 

Binding of BsF(ab') 2 to Jurkat cells 

s [0269] Binding of BsFfab'fe containing different anti-CD3 variants to Jurkat cells (human acute T cell leukemia) was 
investigated by flow cytometry (data not shown). BsF(ab') 2 v9 binds much more efficiently to Jurkat cells than does our 
starting molecule, BsFfab'fe v1 , and almost as efficiently as the chimeric BsF(ab')-) . Installation of additional murine res- 
idues into anti-CD3 v9 to create vIO (V H K75S:N76S) and v12 (V H K75S:N76S plus V L E55N) did not further improve 
binding of corresponding BsFfab^ to Jurkat cells. Nor did recruitment of these murine residues into anti-CD3 v1 

w improve Jurkat binding: V H K75S (v6), V H N76S (v7), V H K75S:N76S (v8), V L E55H (v1 1) (not shown). BsF(ab') 2 v9 was 
chosen for future study since it is amongst the most efficient variants in binding to Jurkat cells and contains fewest 
murine residues in the humanized anti-CD3 arm. A monospecific anti-p1 85 HER2 F(ab' ) 2 did not show significant binding 
to Jurkat cells consistent with the interaction being mediated through the anti-CD3 arm. 

75 DISCUSSION 

[0270] A minimalistic strategy was chosen to humanize the anti-p185 HER2 (Carter et al, 1992a, supra) and anti-CD3 
arms (Shalaby et al., supra) of the BsF(ab') 2 in this study in an attempt to minimize the potential immunogenicity of the 
resulting humanized antibody in the clinic. Thus we tried to install the minimum number of murine CDR and FR residues 

20 into the context of consensus human variable domain sequences as required to recruit antigen-binding affinity and bio- 
logical properties comparable to the murine parent antibody. Molecular modeling was used firstly to predict the murine 
FR residues which might be important to antigen binding and secondly to predict the murine CDR residues that might 
not be required. A small number of humanized variants were then constructed to test these predictions. 
[0271 ] Our humanization strategy was very successful for the anti-p1 85 HER2 antibody where one out of eight human- 

25 ized variants (HuMAb4D5-8, lgG1) was identified that bound the p185 HER2 antigen ~ 3-fold more tightly than the parent 
murine antibody (Carter era/., 1992a, supra). HuMAb4D5-8 contains a total of five murine FR residues and nine murine 
CDR residues, including V H CDR2 residues 60-65, were discarded in favor of human counterparts. In contrast, 
BsFfab^ v1 containing the most potent humanized anti-CD3 variant out of four originally constructed (Shalaby et al., 
supra) binds J6 cells with an affinity (K 6 ) of 140 nM which is ~ 70-fold weaker than that of the corresponding chimeric 

30 BsFtab^. 

[0272] Here we have restored T cell binding of the humanized anti-CD3 close to that of the chimeric variant by replac- 
ing six human residues in V H CDR2 with their murine counterparts: T57S:A60N:D61Q:S62K:V63F:G65D (anti-CD3 v9, 
Fig. 5). It appears more likely that these murine residues enhance antigen binding indirectly by influencing the confor- 
mation of residues in the N-terminal part of V H CDR2 rather than by directly contacting antigen. Firstly, only N-terminal 

35 residues in V H CDR2 (50-58) have been found to contact antigen in one or more of eight crystallographic structures of 
antibody/antigen complexes (Kabat etai, supra; and Mian, I. S. era/., J. Mol. Biol. 217: 133-151 (1991). Fig. 5). Sec- 
ondly, molecular modeling suggests that residues in the C-terminal part of V H CDR2 are at least partially buried (Fig. 
5). BsFfab^ v9 binds to SK-BR-3 breast tumor cells with equal efficiency to BsF(ab') 2 v1 and chimeric BsF(ab*) 2 as 
anticipated since the anti-p185 HER2 arm is identical in all of these molecules (Shalaby ef al., supra, not shown). 

40 [0273] Our novel approach to the construction of BsF(ab*)3 fragments exploits an E. coli expression system which 
secretes humanized Fab' fragments at gram per liter titers and permits their direct recovery as Fab'-SH (Carter et al., 
1992b, supra). Traditional directed chemical coupling of Fab'-SH fragments is then used to form BsF(ab') 2 in vitro 
(Brennan et al.,supra; and Glennie ef al., supra). This route to Fab'-SH obviates problems which are inherent in their 
generation from intact antibodies: differences in susceptibility to proteolysis and nonspecific cleavage resulting in het- 

45 erogeneity, low yield as well as partial reduction that is not completely selective for the hinge disulfide bonds. The strat- 
egy of using E. co//-derived Fab'-SH containing a single hinge cysteine abolishes some sources of heterogeneity in 
BsFfab^ preparation such as intra-hinge disulfide formation and contamination with intact parent antibody whilst 
greatly diminishes others, eg. formation of F(ab' ) 3 fragments. 

[0274] BsF(ab*)2 fragments constructed here were thioether-linked as originally described by Glennie et al, supra 
so with future in vivo testing of these molecules in mind. Thioether bonds, unlike disulfide bonds, are not susceptible to 
cleavage by trace amounts of thiol, which led to the proposal that thioether-linked F(ab' ) 2 may be more stable than 
disulfide-linked F(ab* ) 2 in vivo (Glennie etai, supra). This hypothesis is supported by our preliminary pharmacokinetic 
experiments in normal mice which suggest that thioether-linked BsF(ab') 2 v1 has a 3-fbld longer plasma residence time 
than BsF(ab*) 2 v1 linked by a single disulfide bond. Disulfide and thioether-linked chimeric BsF(ab') 2 were found to be 
55 indistinguishable in their efficiency of cell binding and in their retargeting of CTL cytotoxicity, which suggests that o-PDM 
directed coupling does not compromise binding of the BsF(ab') 2 to either antigen (not shown). Nevertheless the nature 
of the linkage appears not to be critical since a disulfide-linked BsF(ab') 2 (murine anti-p185 HER2 / murine anti-CD3) was 
recently shown by others (Nishimura ef al., Int. J. Cancer 50: 800-804 (1992) to have potent anti-tumor activity in nude 



43 



EP 0 940 468 A1 



mice. Our previous study (Shalaby et ai, supra) together with this one and that of Nishimura, T. et ai, supra improve 
the potential for using BsFfab'k in targeted immunotherapy of p185 HER2 -over expressing cancers in humans. EXAM- 
PLE 4. Humanization of an anti-CD1 8 antibody 

[0275] A murine antibody directed against the leukocyte adhesion receptor p-chain (known as the H52 antibody) was 
humanized following the methods described above. Figures 6A and 6B provide amino acid sequence comparisons for 
the murine and humanized antibody light chains and heavy chains. 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 
(ii) TITLE OP INVENTION: Iranunoglobulin Variants 
(iii) NUMBER OF SEQUENCES : 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(B) COUNTRY: USA 
(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disfc 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: patin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715272 

(B) APPLICATION DATE: 14-JUN-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Adler, Carolyn R. 

(B) REGISTRATION NUMBER: 32 , 324 

(C) REFERENCE /DOCKET NUMBER: 709P1 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/225-2614 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 1 : 

m 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp Val Asn 

20 25 30 

Thr Ala Val Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lye 

35 40 45 

Leu Leu He Tyr Ser Ala Ser Phe Leu Glu Ser Gly Val Pro Ser 

50 55 60 

Arg Phe 'Ser Gly Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr He 

65 70 75 

Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

80 85 90 

His Tyr Thr Thr Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu 

95 100 105 

He Lys Arg Thr 

109 

(2) INFORMATION FOR SBQ ID TO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:2: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn He Lys 

20 25 30 

Asp Thr Tyr He His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Ala Arg He Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr 

50 55 60 

Ala Asp Ser Val Lys Gly Arg Phe Thr He Ser Ala Asp Thr Ser 

65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 
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Thr Ala Val Tyr Tyr Cys Ser Arg Trp Gly Qly Asp Gly Phe Tyr 

95 100 105 

Ala Met Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 

110 115 120 



(2) INFORMATION FOR SSQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 109 amino acids 

(B) type : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SSQ ID NO: 3: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Asp Val Ser 

20 25 30 

Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lye Ala Pro Lys 

35 40 45 

Leu Leu lie Tyr Ala Ala Ser Ser Leu Glu Ser Gly Val Pro Ser 

50 55 60 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie 

65 70 75 

Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

80 65 90 

Tyr Asn Ser Leu Pro Tyr Thr Phe Gly Gin Gly Thr Lys Val Glu 

95 100 105 

lie Lys Arg Thr 

109 

(2) INFORMATION FOR SBQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SBQ ID NO:4: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
1 5 10 15 

r 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser 

20 25 30 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



Asp Tyr Ala Met Ser Trp Val Arg Gin Ala Pro Gly Lye Gly Leu 

35 40 45 

Glu Trp Val Ala Val lie Ser Glu Asn Gly Gly Tyr Thr Arg Tyr 

50 55 €0 

Ala Asp Ser Val Lye Gly Arg Phe Thr Zle Ser Ala Asp Thr Ser 

65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg. Ala Glu Asp 

80 85 90 

Thr Ala Val Tyr Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr 

95 100 105 

Ala Met Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 

110 115 120 



(2) INFORMATION FOR SBQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 109 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Asp lie Val. Met Thr Gin Ser His Lys Phe Met Ser Thr Ser Val 
15 10 15 

Gly Asp Arg Val Ser lie Thr CyB Lys Ala Ser Gin Asp Val Asn 

20 25 30 

Thr Ala Val Ala Trp Tyr Gin Gin Lys Pro Gly His Ser Pro Lys 

35 40 45 

Leu Leu lie Tyr Ser Ala Ser Phe Arg Tyr Thr Gly Val Pro Asp 

50 55 60 

Arg Phe Thr Gly Asn Arg Ser Gly Thr Asp Phe Thr Phe Thr lie 

65 70 75 

Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys Gin Gin 

80 85 90 

Hi 8 Tyr Thr Thr Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu 

95 100 105 

lie Lys Arg Ala 

109 
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15 
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30 



35 



40 



45 



(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Glu Val Oln Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly 
1 5 10 15 

Ala Ser Leu Lye Leu Ser Cys Tfar Ala Ser Gly Phe Asn lie Lye 

20 25 30 

Asp Thr Tyx lie His Trp Val Lys Gin Arg Pro Glu Gin Gly Leu 

35 40 45 

Glu Trp lie Gly Arg lie Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr 

20 50 55 €0 

Asp Pro LyB Phe Gin Asp Lys Ala Thr lie Thr Ala Asp Thr Ser 

65 70 75 



Ser Asn Thr Ala Tyr Leu Gin Val Ser Arg Leu Thr Ser Glu Asp 

80 85 90 

Thr Ala Val Tyr Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr 

95 100 105 

Ala Met Asp Tyr Trp Gly Gin Gly Ala Ser Val Thr Val Ser Ser 

110 115 120 



(2) INFORMATION FOR SBQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 27 bases 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SBQ ID NO: 7: 



TCCGATATCC AGCTGACCCA GTCTCCA 27 



(2) INFORMATION FOR SBQ ID NO: 8: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : single 
"(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 



GTTTGATCTC CAGCTTGGTA CCXXCXCCGA A 31 



(2) INFORMATION FOR SBQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:9: 



AGGTXXAXCT GCAGXAGTCX GG 22 



(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 



TGAGGAGACG GTGACCGTGG TCCCTTGGCC CCAG 34 



(2) INFORMATION FOR SBQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 11: 



GTAGATAAAT CCTCTAACAC AGCCTATCTG CAAATG 36 
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(2) INFORMATION FOR SBQ ID NO: 12: 

m 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYFB: nucleic acid 

(C) STRANDBDNBSS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 12: 



GTAGATAAAT CCAAATCTAC AGCCTATCTG CAAATG 36 



<2) INFORMATION FOR SBQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GTAGATAAAT CCTCTTCTAC AGCCTATCTG CAAATG 36 



(2) INFORMATION FOR SBQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6B bases 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTTATAAAGG TGTTTCCACC TATAACCAGA AATTCAAGGA TCGTTTCACG 50 



ATATCCGTAG ATAAATCC 68 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 15: 



CTATACCTCC CGTCTGCATT CTGGAGTCCC 30 



<2) INFORMATION FOR SBQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 16 



Asp He Gin Met Thr Gin Thr Thr Ser Ser Leu Ser Ala Ser Leu 
15 10 15 

Gly Asp Arg Val Thr He Ser Cys Arg Ala Ser Gin Asp He Arg 

20 25 30 

Asn Tyr Leu Asn Trp Tyr Gin Gin Lye Pro Asp Gly Thr Val Lys 

35 40 45 

Leu Leu He Tyr Tyr Thr Ser Arg Leu His Ser Gly Val Pro Ser 

50 55 60 

Lys Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He 

65 70 75 

Ser Asn Leu Glu Gin Glu Asp He Ala Thr Tyr Phe Cys Gin Gin 

80 85 90 

Gly Asn Thr Leu Pro Trp Thr Phe Ala Gly Gly Thr Lys Leu Glu 

95 100 105 

He Lys 
107 

40 (2) INFORMATION FOR SBQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 17: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp He Arg 

20 25 30 
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Asn Tyx Leu Asn Trp Tyx Gin Gin Lys Pro Gly Lye Ala Pro Lys 

35 40 45 

Leu Leu lie Tyx Tyx Thr Ser Arg Leu Glu 8er Gly Val Pro Ser 

50 55 60 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He 

65 70 75 

Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr- Cys Gin Gin 

60 05 90 

Gly Asn Thr Leu Pro Trp Thr Phe Gly Gin Gly Thr Lys Val Glu 

95 100 105 

He Lys 

107 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

X 5 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Ser He Ser 

20 25 30 

Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 

35 40 45 

Leu Leu He Tyr Ala Ala Ser Ser Leu Glu Ser Gly Val Pro Ser 

50 55 60 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He 

65 70 75 

Ser Ser Leu Gin Fro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

B0 85 90 

Tyr Asn Ser Leu Pro Trp Thr Phe Gly Gin Gly Thr Lys Val Glu 

95 100 105 

He Lys 
107 
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(2) INFORMATION FOR SBQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 19: 

Qlu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly 
15 10 15 

Ala Ser Met Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Thr 

20 25 30 

Gly Tyr Thr Met Asn Trp Val Lys Gin Ser Hie Gly Lys Asn Leu 

35 40 45 

Glu Trp Met Gly Leu lie Asn Pro Tyr Lys Gly Val Ser Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Asp Arg Phe Thr lie Ser Lys Ala Thr Leu 

65 70 75 

Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr Leu Met Glu Leu Leu 

80 65 90 

Asn Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ala Arg 

95 100 105 

Ser Gly Tyr Tyr Gly Asp Ser Asp Trp Tyr Phe Asp Val Trp Gly 

110 115 120 

Ala Gly Thr Thr Val Thr Val Ser Ser 

125 129 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:20: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Fro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Thr 

20 25 30 

Gly Tyr Thr Met Asn Tip Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 
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Glu Trp Val Ala Leu He Asn Pro Tyx Lye Gly Val Ser Thr Tyr 

50 55 60 

Asn Gin Lye Phe Lys Asp Arg Phe Thr lie Ser Val Asp Lye Ser 

65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 B5 90 

Thr Ala Val Tyr Tyr Cys Ala Arg Ser Gly Tyr Tyr Gly Asp Ser 

95 100 105 

Asp Trp Tyr Phe Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val 
75 110 115 120 



10 



20 



25 



30 



40 



45 



50 



122 

(2) INFORMATION FOR SEQ ID NO: 21: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 21: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser 

20 25 30 

Ser Tyr Ala Met Ser Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 35 40 45 

Glu Trp Val Ser Val He Ser Gly Asp Gly Gly Ser Thr Tyr Tyr 

50 55 60 



Ala Asp Ser Val Lye Gly Arg Phe Thr He Ser Arg Asp Asn Ser 

65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

B0 85 90 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly Arg Val Gly Tyr Ser Leu 

95 100 105 

Ser Gly Leu Tyr Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val 

110 115 120 

Ser Ser 
122 



55 



54 



EP 0 940 468 A1 



(2) INFORMATION FOR SBQ ID NO: 22: 

m 

(i) 



(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 22: 

Gin Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly 
1 5 .10 15 

Ala Ser Val Lys lie Ser Cys Lys Thr Ser Gly Tyr Thr Phe Thr 

20 25 30 

* 

Glu Tyr Thr Met His Trp Met Lys Gin Ser His Gly Lys Ser Leu 

35 40 45 

Glu Trp lie Gly Gly Phe Asn Pro Lys Asn Gly Gly Ser Ser His 

50 55 €0 

Asn Gin Arg Phe Met Asp Lys Ala Thr Leu Ala Val Asp Lys Ser 

65 70 75 

Thr Ser Thr Ala Tyr Met Glu Leu Arg Ser Leu Thr Ser Glu Asp 

80 65 90 

Ser Gly lie Tyr Tyr Cys Ala Arg Trp Arg Gly Leu Asn Tyr Gly 

95 100 105 

Phe Asp Val Arg Tyr Phe Asp Val Trp Gly Ala Gly Thr Thr Val 

110 115 120 

Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 

125 130 135 

Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly 

140 145 150 

Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp 

160 165 



Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val 

170 175 180 

Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Tnr Val 

185 190 195 

Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn 

200 205 210 

His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys 

215 220 225 
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Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 

230 235 240 

Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys 

245 250 



Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val 

260 265 270 

Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr 

275 280 285 

Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 

290 295 300 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val 

305 310 315 

Leu His Gin Asp Trp lieu Asn Gly Lys Glu Tyr Lys Cys Lys Val 

320 325 330 

Ser Asn Lys Ala Leu Pro Ala Pro He Glu Lys Thr Zle Ser Lys 

340 345 



Ala Lys Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro 

350 355 360 

Ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu 

365 370 375 

Val Lys Gly Phe Tyr Pro Ser Asp He Ala Val Glu Trp Glu Ser 

380 385 390 

i . 

Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu 

400 405 



Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp 

410 415 420 

Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met 

425 430 



His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu 

440 445 450 

Ser Pro Gly Lys 

454 

(2) INFORMATION FOR SBQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 amino 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SBQ ID 110:23: 

His His Gin Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys 
15 10 15 

Pro Gly Ala Ser Val Lys He Ser Cys Lys Thr Ser Gly Tyr Thr 

20 25 30 

Phe Thr Glu Met Gly Trp Ser Cys He He Leu Phe Leu Val Ala 

35 40 45 

Thr Ala Thr Gly Val His Ser Glu Val Gin Leu Val Glu Ser Gly 

50 55 60 

Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala 

65 70 75 

Thr Ser Gly Tyr Thr Phe Thr Glu Tyr Thr Met His Trp Met Arg 

80 85 90 

Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Gly He Asn Pro 

95 100 105 

Lys Asn Gly Gly Thr Ser His Asn Gin Arg Phe Met Asp Arg Phe 

110 115 120 

Thr He Ser Val Asp Lys Ser Thr Ser Thr Ala Tyr Met Gin Met 

130 135 



Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg 

140 145 150 

Trp Arg Gly Leu Asn Tyr Gly Phe Asp Val Arg Tyr Phe Asp Val 

160 165 



Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys 

170 175 180 

Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser 

185 190 195 

Glu Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro 

200 205 210 

Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly 

215 220 



Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 

230 235 240 

Leu Ser Ser Val Val Thr Val Thr Ser Ser Asn Phe Gly Thr Gin 

245 250 
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Tyr Thr Cys Asn VaX Asp His Lys Pro 

260 



Ser Aan Thr Lye Vol 

270 



Asp Lys Thr Val Glu Arg Lys Cys Cys Val Thr Cys Pro 

275 280 



Pro Cys 
285 



Pro Ala Pro Glu Leu Leu Gly Gly Pro 

290 



Ser Val Phe Leu Phe Pro 

300 



Pro Lys Pro 



Lys Asp 
305 



Leu Met lie Ser 

310 



Arg Thr. Pro Glu Val 



Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys 

320 325 330 



Glu Cys Pro Pro Cys Pro Ala Pro 



Pro Val Ala Gly 

340 



Pro Ser Val 



Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met He Ser Arg 

350 355 360 



Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp 

370 375 



Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly Met Glu Val His 

380 385 390 



Asn Ala Lys Thr Lys Pro 

395 



Glu Glu Gin Phe Asn Ser Thr Phe 
400 405 



Arg Val Val Ser Val Leu Thr Val Val His Gin Asp Trp Leu Asn 

410 415 420 

Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ala 

425 430 435 



Glu Lys Thr He 
440 



Lys Thr Lys Gly Gin Pro 
445 



Glu 
450 



Pro Gin Val Tyr Thr Leu Pro Pro 



Asn Gin Val Ser 



Ser Arg Glu Glu Met Thr Lys 
460 465 



Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro 
470 475 



480 



Asp He Ala Val Glu Trp Glu Ser 

485 



Asn Gly Gin Pro Glu Asn Asn 

490 495 



Tyr Lys Thr Thr Pro Pro Met Leu Asp Ser Asp Gly Ser Phe Phe 

500 505 510 



Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly 

520 
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Asn Val Phe Ser Cys Ser Val Met Hie Glu Ala Leu His Asn His 

530 535 540 

Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 

545 550 555 

(2) INFORMATION FOR SBQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Asp Val Gin Met Thr Gin Thr Thr Ser Ser Leu Ser Ala Ser Leu 
15 10 15 

Gly Asp Arg Val Thr lie Asn Cys Arg Ala Ser Gin Asp lie Asn 

20 25 30 

Asn Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Asn Gly Thr Val Lys 

35 40 45 

Leu Leu lie Tyr Tyr Thr Ser Thr Leu His Ser Gly Val Pro Ser 

50 55 60 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He 

65 70 75 

Ser Asn Leu Asp Gin Glu Asp He Ala Thr Tyr Phe Cys Gin Gin 

80 85 90 

Gly Asn Thr Leu Pro Pro Thr Phe Gly Gly Gly Thr Lys Val Glu 

95 100 105 

■ 

He Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro 

110 115 120 

Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu 

125 130 135 

Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val 

140 145 150 

Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu 

160 165 



Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr 

170 175 180 

Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu 

185 190 195 
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w 



15 



30 



35 



40 



45 



50 



Val Thr His Gin Oly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn 

200 205 210 

Arg Gly Glu Cys 

214 

(2) INFORMATION FOR S5Q ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 25: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr 
15 10 



Gly Val His Ser Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu 

20 20 25 30 

Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser 

35 40 45 

25 Gin Asp He Asn Asn Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Gly 

50 55 60 

Lys Ala Pro Lys Leu Leu He Tyr Tyr Thr Ser Thr Leu His Ser 

65 70 75 



Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr 

60 85 90 

Thr I^eu Thr He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr 

95 100 105 

Tyr Cys Gin Gin Gly Asn Thr Leu Pro Pro Thr Phe Gly Gin Gly 

110 115 120 



Thr Lys Val Glu He Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

125 130 



He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala Ser 

140 145 150 

Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val 

160 165 



Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin Glu 

170 175 180 

Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser 

185 190 195 
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Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val 

200 205 210 

Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro Val Thr 

215 220 225 

Lys Ser Phe Asa Arg Gly Glu Cys 

230 233 



Claims 

1 . A humanized antibody variable domain having a non-human Complementarity Determining Region (CDR) incorpo- 
rated into a human antibody variable domain and further comprising an amino acid substitution at a site selected 
from the group consisting of: 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 64L, 65L, 66U 67L, 68L, 69L, 70L, 71L, 
73L, 85L, 87L. 98U 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 92H, 
and 93H. 

2. The humanized antibody variable domain of claim 1 , wherein the substituted residue is the residue found at the cor- 
responding location of the non-human antibody from which the non-human CDR was obtained. 

3. The humanized antibody variable domain of claim 1 , wherein no human Framework Region (FR) residue other than 
those set forth in the group has been substituted. 
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Anneal huV f or huV H oligomers to pAKl template 




1. Ligate 

2. Isolate assembled oligomers 

3. Anneal to pAKl template (Xholz Stul+) 

4. Extend and ligate 




1. Transform E. coli 

2. Isolate phagemid pool 

3. Enrich for hu\^ and huV H (Xho I + StuI') 

4. Sequence verify 




FIG. 2 
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